Detecting and indexing moving objects for behavior analysis by video and audio interpretation