lip reading systems & proposed joint audio visual speech processing system
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Active In SP

Posts: 2
Joined: Feb 2010
28-02-2010, 10:44 AM

hi plz mail me seminar and presentation report of this topic , i am in urgent need of this.
Active In SP

Posts: 1
Joined: Apr 2010
01-04-2010, 08:02 PM

hi sho can u send me ppt about lip reading and its proposed joints to my email you very much
Active In SP

Posts: 291
Joined: Apr 2010
03-04-2010, 08:49 AM

Joint Audio-Visual Speech Processing
Visual speech information present in the speakerâ„¢s mouth region
has long been viewed as a source for improving the robustness
and naturalness of human-computer-interfaces . where the acoustic channel is corrupted, the automatic speech recognition
(ASR) systems falls below usability and this system comes into use here.

Human speech is by nature bimodal, both in its production and
perception. humans integrate audio
and visual stimuli to perceive speech. Researchs have been going on the integration of the visual modality into the speech channel of the human-computerinterface (HCI), aiming in improving its robustness and naturalness. the visual channel can benefit processes such as speaker identification, verification, localization, speech event detection , speech signal separation , coding , video indexing and retrieval , and text-to-speech.

The Visual Front End:
Visual speech features generally fit into one of the following
three categories:
a)Appearance based features: assume that all video pixels
within a region-of-interest (ROI) are informative about the
spoken utterance.
b) shape based ones:assumes that most speechreading information
is contained in the contours of the speakerâ„¢s lips, or more generally,
of the face
c) or combination of both.

Audio-visual features in our system:
The system used here produces appearance
based features and operates on full face video with no artificial
face markings due to which both face detection and ROI
extraction are required. Tracking provides the mouth location, size, and orientation,
which are then smoothed over time to improve robustness.a 6464 pixel ROI is obtained
for every video frame Based on the resulting estimates. a two-dimensional, separable discrete cosine
transform (DCT) is applied to the ROI, and the 100 highest energy
DCT coefficients are retained. Then an intraframe
linear discriminant analysis (LDA) project and implimentationion is applied To reduce dimensionality which resulting in a 30-dimensional feature vector. Then a a maximum likelihood linear transformation (MLLT) is applied that improves maximum likelihood based statistical data modelling.

Use Search at wisely To Get Information About Project Topic and Seminar ideas with report/source code along pdf and ppt presenaion
Active In SP

Posts: 1
Joined: Jul 2011
29-10-2011, 09:29 AM

hi plz mail me the full report and the ppt.

Important Note..!

If you are not satisfied with above reply ,..Please


So that we will collect data for you and will made reply to the request....OR try below "QUICK REPLY" box to add a reply to this page

Quick Reply
Type your reply to this message here.

Image Verification
Please enter the text contained within the image into the text box below it. This process is used to prevent automated spam bots.
Image Verification
(case insensitive)

Possibly Related Threads...
Thread Author Replies Views Last Post
  speech amplifier in a walkie talkie and working of each part Guest 1 181 31-10-2016, 12:41 PM
Last Post: anusree
  a speech on environmental pollution in malayalam language Guest 1 130 31-10-2016, 12:40 PM
Last Post: anusree
  halftone visual cryptography using matlab Guest 1 71 31-10-2016, 11:36 AM
Last Post: ijasti
  explanation of the circuit for a speech amplifier in a walkie talkie Guest 1 123 31-10-2016, 11:26 AM
Last Post: anusree
  control systems by nagoor kani notes pdf Guest 1 106 31-10-2016, 10:42 AM
Last Post: amrutha735
  gib and cotter joint ppt Guest 1 61 31-10-2016, 10:33 AM
Last Post: jaseela123
  free download of mind reading computer documentation Guest 1 96 31-10-2016, 10:33 AM
Last Post: anusree
  seminar reports on grid connected pv systems Guest 1 63 31-10-2016, 10:06 AM
Last Post: anusree
  5 pen pc technology existing system and proposed system Guest 1 107 31-10-2016, 09:53 AM
Last Post: amrutha735
  vtu lecture notes speech processing Guest 1 76 31-10-2016, 09:43 AM
Last Post: amrutha735