This book constitutes the thoroughly refereed joint post-workshop proceedings of two co-located events: the Second International Workshop on Classification of Events, Activities and Relationships, CLEAR 2007, and the 5th Rich Transcription 2007 Meeting Recognition evaluation, RT 2007, held in succession in Baltimore, MD, USA, in May 2007.
The workshops had complementary evaluation efforts; CLEAR for the evaluation of human activities, events, and relationships in multiple multimodal data domains; and RT for the evaluation of speech transcription-related technologies from meeting room audio collections. The 35 revised full papers presented from CLEAR 2007 cover 3D person tracking, 2D face detection and tracking, person and vehicle tracking on surveillance data, vehicle and person tracking aerial videos, person identification, head pose estimation, and acoustic event detection. The 15 revised full papers presented from RT 2007 are organized in topical sections on speech-to-text, and speaker diarization.
Author(s): Rainer Stiefelhagen, Keni Bernardin, Rachel Bowers, R. Travis Rose, Martial Michel (auth.), Rainer Stiefelhagen, Rachel Bowers, Jonathan Fiscus (eds.)
Series: Lecture Notes in Computer Science 4625
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2008
Language: English
Pages: 558
Tags: Pattern Recognition; Image Processing and Computer Vision; Artificial Intelligence (incl. Robotics); Computer Graphics; Biometrics; Algorithm Analysis and Problem Complexity
Front Matter....Pages -
Front Matter....Pages 1-1
The CLEAR 2007 Evaluation....Pages 3-34
Vehicle and Person Tracking in Aerial Videos....Pages 203-214
Person Tracking in UAV Video....Pages 215-220
The AIT Multimodal Person Identification System for CLEAR 2007....Pages 221-232
The AIT 3D Audio / Visual Person Tracker for CLEAR 2007....Pages 35-46
A Person Tracking System for CHIL Meetings....Pages 47-56
An Appearance-Based Particle Filter for Visual Tracking in Smart Rooms....Pages 57-69
Multi-level Particle Filter Fusion of Features and Cues for Audio-Visual Person Tracking....Pages 70-81
Multispeaker Localization and Tracking in Intelligent Environments....Pages 82-90
Multi-person Tracking Strategies Based on Voxel Analysis....Pages 91-103
TUT Acoustic Source Tracking System 2007....Pages 104-112
The AIT 2D Face Detection and Tracking System for CLEAR 2007....Pages 113-125
PittPatt Face Detection and Tracking for the CLEAR 2007 Evaluation....Pages 126-137
Tsinghua Face Detection and Tracking for CLEAR 2007 Evaluation....Pages 138-147
The AIT Outdoor Tracker for Vehicles and Pedestrians in CLEAR2007....Pages 148-159
Objective Evaluation of Pedestrian and Vehicle Tracking on the CLEAR Surveillance Dataset....Pages 160-173
Person and Vehicle Tracking in Surveillance Video....Pages 174-178
UMD_VDT, an Integration of Detection and Tracking Methods for Multiple Human Tracking....Pages 179-190
CLEAR’07 Evaluation of USC Human Tracking System for Surveillance Videos....Pages 191-196
Speed Performance Improvement of Vehicle Blob Tracking System....Pages 197-202
Front Matter....Pages 1-1
Acoustic Speaker Identification: The LIMSI CLEAR’07 System....Pages 233-239
MIT Lincoln Laboratory Multimodal Person Identification System in the CLEAR 2007 Evaluation....Pages 240-247
Multichannel and Multimodality Person Identification....Pages 248-255
ISL Person Identification Systems in the CLEAR 2007 Evaluations....Pages 256-265
Robust Speaker Identification for Meetings: UPC CLEAR’07 Meeting Room Evaluation System....Pages 266-275
Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups....Pages 276-286
Joint Bayesian Tracking of Head Location and Pose from Low-Resolution Video....Pages 287-296
Learning a Person-Independent Representation for Precise 3D Pose Estimation....Pages 297-306
Head Pose Estimation in Single- and Multi-view Environments - Results on the CLEAR’07 Benchmarks....Pages 307-316
Head Orientation Estimation Using Particle Filtering in Multiview Scenarios....Pages 317-327
The Acoustic Event Detector of AIT....Pages 328-337
An HMM Based System for Acoustic Event Detection....Pages 338-344
HMM-Based Acoustic Event Detection with AdaBoost Feature Selection....Pages 345-353
Acoustic Event Detection: SVM-Based System and Evaluation Setup in CLEAR’07....Pages 354-363
TUT Acoustic Event Detection System 2007....Pages 364-370
Front Matter....Pages 371-371
The Rich Transcription 2007 Meeting Recognition Evaluation....Pages 373-389
The CHIL RT07 Evaluation Data....Pages 390-400
Shared Linguistic Resources for the Meeting Domain....Pages 401-413
The 2007 AMI(DA) System for Meeting Transcription....Pages 414-428
The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture Meetings....Pages 429-441
Front Matter....Pages 371-371
The LIMSI RT07 Lecture Transcription System....Pages 442-449
The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System....Pages 450-463
The ISL RT-07 Speech-to-Text System....Pages 464-474
Progress in the AMIDA Speaker Diarization System for Meeting Data....Pages 475-483
Speaker Diarization Using Direction of Arrival Estimate and Acoustic Feature Information: The I 2 R-NTU Submission for the NIST RT 2007 Evaluation....Pages 484-496
The IBM RT07 Evaluation Systems for Speaker Diarization on Lecture Meetings....Pages 497-508
The ICSI RT07s Speaker Diarization System....Pages 509-519
The LIA RT’07 Speaker Diarization System....Pages 520-532
Multi-stage Speaker Diarization for Conference and Lecture Meetings....Pages 533-542
Speaker Diarization for Conference Room: The UPC RT07s Evaluation System....Pages 543-553
Back Matter....Pages -