抄録
This paper reports our experiments for TRECVID 2010 task: Semantic Indexing. We present two approaches namely, Affective and Holistic. In the first approach, we have used combination of affective features from image, video and audio trained with neural network algorithm. Image features employed are color histogram and face detection from the keyframe. The number of face is also used in one of the runs. Video features include the motion activity and shot duration. Additionally, the audio power is included as feature. For the second approach, color, texture and scene features are extracted from the whole keyframe image as well as its background and saliency regions. Genetic algorithm is used to find the weight of each feature for effective combination. Then, KNN is used to propagate the annotation. We have submitted 4 runs where we distinguish the first two as affective category and the the last two as holistic ones. The summary is as follows: • kmlabGITS1-color histogram, motion, rhythm, sound and face number trained using neural network • kmlabGITS2-color histogram, motion, rhythm, sound and without face number trained using neural network • kmlabGITS3-combination of 5 image features (hsv bg, gabor, haar, gist and lab bg) using Genetic Algorithm and KNN • kmlabGITS4-combination of 5 image features (hsv, hsv bg, haar, haar roi and gist) using Genetic Algorithm and KNN.
本文言語 | English |
---|---|
出版ステータス | Published - 2010 1月 1 |
イベント | TREC Video Retrieval Evaluation, TRECVID 2010 - Gaithersburg, MD, United States 継続期間: 2010 11月 15 → 2010 11月 17 |
Conference
Conference | TREC Video Retrieval Evaluation, TRECVID 2010 |
---|---|
国/地域 | United States |
City | Gaithersburg, MD |
Period | 10/11/15 → 10/11/17 |
ASJC Scopus subject areas
- コンピュータ グラフィックスおよびコンピュータ支援設計
- コンピュータ ビジョンおよびパターン認識
- 人間とコンピュータの相互作用
- ソフトウェア