/********************************************************************
**                Image Component Library (ICL)                    **
**                                                                 **
** Copyright (C) 2006-2013 CITEC, University of Bielefeld          **
**                         Neuroinformatics Group                  **
** Website: www.iclcv.org and                                      **
**          http://opensource.cit-ec.de/projects/icl               **
**                                                                 **
** File   : ICLCV/src/ICLCV/PositionTracker.h                      **
** Module : ICLCV                                                  **
** Authors: Christof Elbrechter                                    **
**                                                                 **
**                                                                 **
** GNU LESSER GENERAL PUBLIC LICENSE                               **
** This file may be used under the terms of the GNU Lesser General **
** Public License version 3.0 as published by the                  **
**                                                                 **
** Free Software Foundation and appearing in the file LICENSE.LGPL **
** included in the packaging of this file.  Please review the      **
** following information to ensure the license requirements will   **
** be met: http://www.gnu.org/licenses/lgpl-3.0.txt                **
**                                                                 **
** The development of this software was supported by the           **
** Excellence Cluster EXC 277 Cognitive Interaction Technology.    **
** The Excellence Cluster EXC 277 is a grant of the Deutsche       **
** Forschungsgemeinschaft (DFG) in the context of the German       **
** Excellence Initiative.                                          **
**                                                                 **
********************************************************************/

#pragma once
#include <vector>
#include <deque>
#include <ICLUtils/Point32f.h>

namespace icl{
  namespace cv{
  
    /// How should new ID's be allocated
    enum IDAllocationMode{
      allocateFirstFreeIDs, // re-using old ID's
      allocateBrandNewIDs   // using next brand-new ID
    };
  
    
    /// Class for tracking 2D positions 
    /** 
    \section INTRO Introduction
    The PositionTracker class provides functionalities for tracking 2D data points. Consider
    the output of a standard blob-detection algorithm: A list of 2D Positions of all found blobs.
    In a special case these positions are not labeled, e.g. because the used blob-detection-algorithm
    does not provide any kind of "tracking-mechanism" that pursues each single blob from one 
    time step to the next. 
    But although having only a list of firstly unassociated "2DPositions-snapshots", it is possible
    to track each blob very efficiently by proceeding form the assumption of slow or continously 
    moving blobs.
    A simple brute-force approach would assign a blob at time t to the closest blob at time t-1.
    If the tracked blobs will move slow, and there is a margin between each blob that is <em>large
    enough</em>, this approach would give us good results at a complexity of O(\f$n^2\f$). 
    But what happens if one of the assumptions was false? The answer is, that the algorithm
    will calculate false assignments (E.g. if a blob moves fast enough, to be closer to another
    blob at at time t-1, the blobs IDs may be swapped).\n
    In addition to this, two cases must be tackled in a special way: What if some existing blobs
    are lost by the blob detector? And what if new blobs are found, that have no corresponding 
    blob at time t-1?
    
    \section ALGO Adaption to the "Brute force"-Algorithm to avoid false assignments
    The first adaption is quite obvious: Instead of expecting a blobs position at time t
    at its position at time t-1, we can predict its position regarding its history (positions at
    time t-1, t-2, ...). This will allow the blobs to move or accelerate (<b>but note:</b> an 
    extrapolation of the blobs position at time t implies an implicit model-assumption, 
    which is e.g. a quadratic one in case of regarding up to 3 former positions for each blob).\n
    Another problem, not mentioned above, is the special case that at time t a single blob 
    (or its extrapolated position for time t) is the nearest of more than one blob at time t-1.
    In this case, we have to decide which blob is assigned to the nearest, and we have to search
    another good match for the other one. This problem can be made arbitrary sophisticated, by
    factoring in further questions like "What if the next good match is already assigned to 
    third blob". So what we need is a general approach to minimize a kind of <em>costs</em> by
    assigning blobs at time t-1 (or their prediction for time t) to blobs at time t, which leads
    to the so called (well known) problem-class of the <b>Linear Assignment Problems</b>.\n
    
    \section LAP Linear Assignment Problems
    The "Assignment Problem class" is defined as follows:
    "There are a number of agents and a number of tasks. Any agent can be assigned to perform any 
    task, incurring some cost that may vary depending on the agent-task assignment. It is required
    to perform all tasks by assigning exactly one agent to each task in such a way that the total
    cost of the assignment is minimized" (Wikipedia, http://en.wikipedia.org/wiki/Assignment_problem).\n
    An additional restriction leads to the class of <b>Linear</b> Assignment Problems:
    "If the numbers of agents and tasks are equal and the total cost of the assignment for all tasks
    is equal to the sum of the costs for each agent (or the sum of the costs for each task, which is 
    the same thing in this case), then the problem is called the Linear assignment problem. Commonly,
    when speaking of the Assignment problem without any additional qualification, then the Linear 
    assignment problem is meant"(Wikipedia).\n
  
    \section HA The "Hungarian Algorithm" for solving linear assignment Problems
    Linear assignment problems can be solved using the Hungarian Algorithm with a complexity
    of O(\f$n^3\f$), where n is the number of tasks as well as the number of agents. The Hungarian
    Algorithm is well analyzed and should not be described further here. More information is available
    e.g. at http://en.wikipedia.org/wiki/Hungarian_algorithm.\n
    We finish this section retaining that the Hungarian Algorithm gets a NxN cost matrix C, where 
    C(i,j) are the costs arising if blob i at t is assigned to blob j at time t-1 (or its pred.;s.o.),
    and it returns the N-dimensional assignment vector a, with a(i)=index of blob at time t-1 that is
    assigned to current blob i.
    
    \section PROB Problems
    As mentioned in sec. \ref INTRO, we are confronted with additional compounding problems:
    - New Blobs are detected, that did not exist before
    - Blobs that still existed at time t-1 are lost at time t
    
    To make use of the Hungarian Algorithm as described in sec. \ref HA, we need a quadratic cost
    matrix, which can be emulated by inserting so called <em>blind values</em>. This values should be
    as different as possible from all other blob positions (e.g. MAX_INT). If blobs are lost since 
    time t-1, we can fill up the vector of new blob positions with blind values, which has the result,
    that these indices are assigned to exactly these old blobs, that match worst to any new blob.
    In the other case, where more blobs have been found at time t as recorded at time t-1, we can
    fill up the vector that contains the predictions for the blobs at time t with blind values. This
    makes the Hungarian Algorithm assigning only <em>brand new</em> blobs to these blind ones.
  
    \section DM PositionTracker Data-Management
    The following ASCII illustration shows a sketch of the internally hold data:
    <pre>
    data matrix scheme: | . . . . N |      (the "."-elements do not exist)
                        | I G H P D |
    
    
    where H is the history matrix:     H = [ x(t-3) x(t-2) x(t-1) ]  and the x(T) are column vectors
                                                                     of all blob positions at time
                                                                     T, and each row r of H contains
                                                                     the history values of one blob.
  
    P is the prediction vector for time t     and P(r) is the prediction result of the r-th row of the 
                                              History matrix H
  
    N is the vector of new data positions   
    
    D is the cost matrix                      where D(c,r) is the square root of the 
                                              euclidian distance of P(r) and N(c)
                                              We have to use the square root and not the euclidian
                                              distnace itself, to avoid that new blobs are
                                              are mixed up with old ones (...)
    
    G is the so called "good count matrix"    where G(r) is the number of valid history steps is the
                                              r-th row of H
  
    I is the ID vector                        where I(r) hold a unique blob ID associated with the
                                              blob thats history complies the r-th row of H
  
    
    detailed scheme:
                                        new data (N)
                                  | o   o  x(t)  o   o
        --------------------------+-----------------------
           o      o     o      o  | d   d   d    d   d      
                                  |
           o      o     o      o  | d   ...                 
                                  |
        x(t-3) x(t-2) x(t-1) x~(t)|     distances 
                                  |         (D)
           o      o     o      o  |
                                  |
           o      o     o      o  |
                               .  |
               History        / \ |  
                 (H)           |
                           Prediction
                              (P)
    </pre>
    
    \section ALGO PositionTracker Algorithm
    All the work is done by calling the pushData function of a PositionTracker object. During the fist call
    the algorithm does not need to solve any assignment problem, so the H(t-1) is set to the new data vector
    N, the ID-vector is initialized with unique IDs at each position G(i)=i and the good count vector G is
    set up with 1 at each position.\n
    Now a call to pushData can be described using the following pseudo code:
    <pre>
    void pushData( vector newData){
      DIFF = currentDataCount - newData.dim()
      if(DIFF <  0){
          push_data_diff_lower_then_zero(...)
      }else if(DIFF > 0){
          push_data_diff_greater_then_zero(...)
      }else{
        push_data_diff_equal_zero(...)
      }
    }
  
    push_data_diff_lower_then_zero(...){
       > add DIFF blind values to H(t-1)
       > add DIFF ones to G
       > calculate prediction vector P
       > remove DIFF ones from G
       > calculate the distance matrix D 
       > apply the Hungarian algorithm with resulting assignment vector a
       > create a vector v_tmp containing only the new data points, that have been assigned to blind values
       > assign the DIFF last rows of H (which were set to blind value) to this new values in
       > give each of these rows a brand new id in I
       > assign the good count vector at these rows to 1, increment all other good counts by one
       > rearrange the new data using the assignment a
       > push the rearranged new data to H from right, and pop the left-most column of H
    }
    
    push_data_diff_greater_then_zero(...){
       > add DIFF blind values to the new data vector 
       > calculate prediction vector P
       > calculate the distance matrix D 
       > apply the Hungarian algorithm with resulting assignment vector a
       > rearrange the new data using the assignment a
       > push the rearranged new data to H from right, and pop the left-most column of H
       > create a vector v_del containing all rows, that have been assigned to blind values
       > use v_del to remove no longer used rows form H, I and G
       > increment all remaining elements of G
    }
    
    push_data_diff_equal_zero(...){
       > calculate prediction vector P
       > calculate the distance matrix D 
       > apply the Hungarian algorithm with resulting assignment vector a
       > rearrange the new data using the assignment a
       > increment all elements of G
    }
    </pre>
   
    \section PERFORMANCE Performance
    The Hungarian algorithm has a worst case complexity of O(\f$n^3\f$), which gives the algorithm a
    poor performance when the Blob count is about 100 or more. The \f$n^3\f$ grows very fast: 20 Blobs
    can be tracked in about one milli second, 200 Blobs already need about 200ms. The following
    two diagrams illustrate the performance:
    \image html bench1.jpg "Performance for 2-20 Blobs" width=4cm
    \image html bench2.jpg "Performance for 0-500 Blobs (the green line is show an O(n^2) approximation, the red on is O(n^3)" width=4cm
  
  
        \section OPT_ Optimization
        If the optimized flag and a valid threshold is given to the constructor, the pushData function is implemented as follows:
        Because in real applications, there are many successive time steps, where pushData gets the nearly equal data, pushData
        trys to associate old and new data with a trivial min-distance matching: If the data dimension has not changed, for each 
        old data item (without extrapolation), the nearest new data item is chosen. If no conflicts arise (one old center is the
        nearest to more then one new center and if all minimum distances are below the given threshold, this trivial assignment is
        used. Otherwise the default algorithm is applied, and the optimization has no effect. <b>Note:</b> If the given threshold
        is smaller or equal to zero or the data dimension changes from on push call to another, no optimization is performed.
    */
  
    template<class valueType>
    class ICLCV_API PositionTracker{
      public:
      
     
      
  
      /// Empty default constructor without any optimization
      PositionTracker():m_bTryOptimize(false),m_currentID(0),m_IDAllocationMode(allocateFirstFreeIDs),m_tThreshold(0){}
  
      /// *NEW* constructor with optimization enabled and given theshold
      /** @param threshold threshold for optimization (must be > 0) \ref OPT_ */
      PositionTracker(valueType threshold):
        m_bTryOptimize(true),m_currentID(0),
        m_IDAllocationMode(allocateFirstFreeIDs),m_tThreshold(threshold){}
      
      /// most common function, adds a new data row, and causes all internal computation (see above)
      /** @param xys data vector with xyxy.. data order 
          @param n count of xy-pairs in xys (xys.length/2)*/
      void pushData(valueType *xys, int n);
  
      /// as above
      void pushData(const std::vector<valueType> &xs, const std::vector<valueType> &ys);
  
      /// as above
      void pushData(const std::vector<utils::Point32f> points);
      
      /** Switch between continuous and restocking ID allocation. Restocking means, that a new object 
          will get the smallest avialable ID (formerly used by an older object which is no longer existing), instead of a brand new.
          @param mode on if true IDs will we be allocated continuously
      */
      void setIDAllocationMode(IDAllocationMode mode){
        m_IDAllocationMode = mode;
      }
  
      /// returns the unique id of a just pushe data point (x,y)
      /** A problem occurs, if more than on point with coordinates (x,y) was 
          pushed, in this case, this function will return the first found one.
          To avoid this, the next function (getID(int index)) can be used to
          specify the datapoint by its index 
      */
      int getID(valueType x, valueType y);
      
      /// returns the unique id of just pushed data point with given index
      int getID(int index);
  
      private:
      
      /// internally used vector type
      typedef std::vector<valueType> Vec;
  
      /// internally used matrix type
      typedef std::vector<Vec> Mat;
      
      /// internally used queue-matrix (columns can be pushed and pop'ed in constant time
      typedef std::deque<Vec> QMat;
  
      /// internal storage of the history matrix H
      QMat m_matData[2];
      
      /// internal storage of the last calculated assignment
      std::vector<int> m_vecCurrentAssignment;
      
      /// internal storage of the unique ids I
      std::vector<int> m_vecIDs;
      
      /// internal storage for the good data count G
      std::vector<int> m_vecGoodDataCount;
      
      /// flag to indicate whether to try optimization (trivial assignment)
      bool m_bTryOptimize;
  
      /// first unused ID
      int m_currentID;
  
      /// flag to indicate which type of ID allocation should be used    
      IDAllocationMode m_IDAllocationMode;
  
      /// threshold distance
      valueType m_tThreshold;
    };
    
    
  } // namespace cv
}