speech processing
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Active In SP

Posts: 3
Joined: Jan 2010
26-01-2010, 01:54 PM

hi i need full details of 'independent speech recognition.'
Active In SP

Posts: 247
Joined: Jan 2010
27-01-2010, 02:53 AM

Speaker recognition methods can be divided into text-independent and text-dependent methods. In a independent system, speaker models capture characteristics of somebody's speech, which show up irrespective of what one is saying. This paper is based on independent speaker recognition system and makes use of mel frequency cepstrum coefficients to process the input signal and vector quantization approach to identify the speaker. The above task is implemented using MATLAB.
This report is available in the thread:

Text Independent Speaker Recognition
The main method of speech recognition is to decode the speech signal in a
sequential method based on the observed acoustic features of the signal and known relations between acoustic features and phonetic symbols. The pattern recognition approach to speech recognition is the one in which the speech patterns are directly used and it involves involves the training of speech patterns and recognition of patterns via pattern comparison. the machine
learns which acoustic properties of the speech class are reliable and repeatable across all training tokens of the pattern in the characterization of speech via training pattern.
The performance of the speech recognition systems is given in terms of a word error rate (%) as measured for a specified technology, for a given task, with specified task syntax, in a specified mode, and for a specified word vocabulary. Continuous speech recognition systems find applications in voice repertory
dialer where eyes free, hands free dialing of numbers is possible. Speaker recognition involves the speaker identification to output the identity of the person most likely to have spoken from among a given population or to verify a personâ„¢s identity who he/she claims to be from a given speech input. Automatic speaker verification (ASV) requires a mere comparison between test pattern and one reference template and involves a binary decision of whether to accept or reject a speaker.

Feature based on Cepstrum
The short-time speech spectrum in the case of the speech has two components:
1) harmonic
peaks due to the periodicity of voiced speech, and
2) glottal pulse shape.
periodicity of voiced speech is decided by the excitation source. The variations among speakers are
indicated by formant locations and bandwidth.

Speech recognition based on proposed features
This includes features like, training and testing
data, building VQ codebook models for all digits and continuous speeches of speakers and testing each utterance against a certain number of speech models to detect the identity of the speech of that utterance from among the speech models. The speech database used contains isolated digits from TI digits_1 and TI digits_2 in case of isolated digit recognition. This is evaluated by using digits pronounced by other speakers in the database. Speaker independent continuous speech
recognition system is evaluated on training data formed by concatenation of dialect sentences of 24 speakers and test data from 100 speakers in the TIMIT database.

Full report:

Attached Files
.pdf   Text Independent Speaker Recognition and.pdf (Size: 950.29 KB / Downloads: 115)
Use Search at http://topicideas.net/search.php wisely To Get Information About Project Topic and Seminar ideas with report/source code along pdf and ppt presenaion
seminar ideas
Super Moderator

Posts: 10,003
Joined: Apr 2012
14-07-2012, 02:18 PM

Digital Speech Processing

.pdf   Digital Speech Processing.pdf (Size: 568.22 KB / Downloads: 72)

Digital Speech Coding

Digital Speech
– Convert analog speech to digital form and transmit digitally
• Applications
– Telephony: (cellular, wired and Internet- VoIP)
– Speech Storage (Automated call-centers)
– High-Fidelity recordings/voice
– Text-to-speech (machine generated speech)
• Issues
– Efficient use of bandwidth
• Compress to lower bit rate per user => more users
– Speech Quality
• Want tollgrade or better quality in a specific transmission environment
• Environment ( BER, packet lost, packet out of order, delay, etc.)
– Hardware complexity
• Speed (coding/decoding delay), computation requirement and power

Characteristics of Speech

– Most of energy between 20 Hz
to about 7KHz ,
– Human ear sensitive to energy
between 50 Hz and 4KHz
• Time Signal
– High correlation
– Short term stationary
• Classified into four categories
– Voiced : created by air passed
through vocal cords (e.g., ah, v)
– Unvoiced : created by air
through mouth and lips
(e.g., s, f )
– Mixed or transitional
– Silence

Digital Speech

Speech Coder: device that converts speech to digital
• Types of speech coders
– Waveform coders
• Convert any analog signal to digital form
– Vocoders (Parametric coders)
• Try to exploit special properties of speech signal to reduce bit rate
• Build model of speech – transmit parameters of model
– Hybrid Coders
• Combine features of waveform and vocoders

Waveform Coders (e.g.,PCM)

Waveform Coders
• Convert any analog signal to digital -
basically A/D converter
• Analog signal sampled > twice highest
frequency- then quantized into ` n ‘ bit
• Uniform quantization
• Example Pulse Code Modulation
• band limit speech < 4000 Hz
• pass speech through μ−law compander
• sample 8000 Hz, 8 bit samples
• 64 Kbps DS0 rate
• Characteristics
• Quality – High
• Complexity – Low
• Bit rate – High
• Delay - Low
• Robustness - High

Important Note..!

If you are not satisfied with above reply ,..Please


So that we will collect data for you and will made reply to the request....OR try below "QUICK REPLY" box to add a reply to this page

Quick Reply
Type your reply to this message here.

Image Verification
Please enter the text contained within the image into the text box below it. This process is used to prevent automated spam bots.
Image Verification
(case insensitive)

Possibly Related Threads...
Thread Author Replies Views Last Post
  speech amplifier in a walkie talkie and working of each part Guest 1 185 31-10-2016, 12:41 PM
Last Post: anusree
  a speech on environmental pollution in malayalam language Guest 1 133 31-10-2016, 12:40 PM
Last Post: anusree
  explanation of the circuit for a speech amplifier in a walkie talkie Guest 1 126 31-10-2016, 11:26 AM
Last Post: anusree
  vtu lecture notes speech processing Guest 1 81 31-10-2016, 09:43 AM
Last Post: amrutha735
  thesis topics for image processing Guest 1 63 29-10-2016, 03:17 PM
Last Post: Dhanabhagya
  detection of malarial parasite in blood using image processing source code Guest 1 77 08-10-2016, 03:51 PM
Last Post: Dhanabhagya
  digital image processing author by r l rekha pdf download Guest 2 240 04-06-2016, 02:07 PM
Last Post: Dhanabhagya
  glaucoma image processing Guest 1 54 29-04-2016, 03:03 PM
Last Post: mkaasees
  solution manual for biomedical digital signal processing by willis Chutimon 1 80 25-04-2016, 04:05 PM
Last Post: mkaasees
Thumbs Down fruit sorting with image processing ppt Guest 1 67 25-04-2016, 03:50 PM
Last Post: mkaasees