WARNING: This server is unstable and will be retired in the next days. If you want to keep this forum available, please request immediately a migration on the Nabble Support forum. Forums that don't receive any migration request will be deleted forever.

 « Return to Thread: How does vad work?

How does vad work?

by Damon Casale :: Rate this Message:

| View in Thread

Some parts of this message have been removed. Learn more about Nabble's security policy.

The online documentation isn’t completely clear on how it works:

 

Voice Activity Detector. Attempts to trim silence and quiet background sounds from the ends of (fairly high resolution i.e. 16-bit, 44−48kHz) recordings of speech.

 

Options: 
Default values are shown in parenthesis. 
−t 
num (7)

The measurement level used to trigger activity detection. This might need to be changed depending on the noise level, signal level and other charactistics of the input audio.

 

What is a “measurement level”?  What does the default number 7 represent?

 

I’m trying to use vad to trim silence from a spoken phrase like “guess what”.  Using vad to trim the front of the audio clip eliminates the first half of the word “guess” and I’m trying to figure out how to fix that.

 

Thanks…

 

Damon

 


------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Sox-users mailing list
Sox-users@...
https://lists.sourceforge.net/lists/listinfo/sox-users

 « Return to Thread: How does vad work?