Development of a three dimensional audio-visual next generation speech recognition system. To overcome the disadvantages of current Audio-Visual Speech Recognition Systems, we propose a set of robust algorithms in three dimensional computer vision and speech processing. The proposed system will have far-reaching implications in various areas, for example, human-machine interaction for speech recognition in automated dialog systems and voice-to-text conversions.