新聞中心

互動視窗

作者:何軍,陳超,馬翌倫,徐成 湖南大學 時間:2008-11-10 來源: 收藏

        圖像、聲音等媒體識別技術(shù)的發(fā)展與應用正在悄然地給人們生活帶來變革。互動視窗針對博物館文物展示等類似應用場合,以對參觀者手勢、意念行為、語音等信息的識別和處理為技術(shù)基礎(chǔ),設(shè)計一個方便、自然和人性化的互動平臺。

        作品在EC5-1719CLDNA上,針對展品和參觀者行為擴展了攝像頭、拾聲器、可控展品臺等硬件裝置;研究了手勢識別與跟蹤、人臉檢測與跟蹤、非特定人中英文連續(xù)語音識別等可用于人機交互的新方法和算法;研究并改進了基于手形結(jié)構(gòu)特征的識別算法,建立了手勢缺陷圖的判斷方法;在Linux上實現(xiàn)了具有物品展示、多媒體評論、娛樂等功能的互動平臺。

        作品還針對平臺雙核的特點,對識別算法進行并行化處理。圖像識別使用多線程的方式,對手勢識別、人臉檢測進行并行處理;手勢識別的手勢分割和識別過程采用了OpenMP優(yōu)化計算,有效地提高了系統(tǒng)的運行速度。

        全息物品展示功能可以理解參觀者意圖,讓參觀者立體而真實的觀看所展物品,隨心所欲的觀看每個細節(jié);多媒體評論功能獲取參觀者的音頻和繪制的圖像,為參觀者提供了交流平臺;對展示物品的手勢拼圖游戲豐富了物品展示的功能,增加了娛樂性。

        互動視窗中所實現(xiàn)的基于音、視頻識別的多種人機交互方式還可以有更廣闊的應用。

The development and application of image, voice and other media recognition technology are quietly to bring changes to people's lives. The production is designed for museum and some similar places. We have designed a convenient, natural and customized embedded platform based on the recognition and analyze of the gesture, behavior and voice of the visitors.
Camera, microphone, controllable platform and other hardware are added to EC5-1719CLDNA. Gesture identification and tracking, face detection and tracking, human-independent English and Chinese continuous speech recognition have been researched; improved the recognition algorithm of hand-shaped structure, gesture concave map judgments; achieved features like display of items, multimedia reviews, entertainment on Linux.

To the characteristics of dual-core platform, we optimized the recognition algorithm for parallel processing. The gesture recognition, face detection are paralleled; the gesture division and identification process of gesture recognition are optimized by OpenMP, which effectively improved the speed of system.

Holographic display feature is capable to understand the visitor’s idea and show what they want to watch; multimedia feature achieve comment on the voice and drawing, providing a platform for communication; Puzzle Game that play by hand gesture enriched the system and added entertainment.

The multi-channel interaction which implemented by image and voice recognition is able to be used in more application.

linux操作系統(tǒng)文章專題:linux操作系統(tǒng)詳解(linux不再難懂)


關(guān)鍵詞: 英特爾 嵌入式 競賽

評論


相關(guān)推薦

技術(shù)專區(qū)

關(guān)閉