Voice Activity Detection #5

nayyarv · 2014-11-05T05:20:46Z

In a typical voice sample, you can see that a significant portion of the speech does not consist of any activity, so in typical speech applications, the regions that don't have much happening are instead removed, as this results in us extracting the MFCCs of silence which aren't very helpful in most situations.

I've modified your code slightly to allow for Voice Activity Detection, it's default behaviour is still intact, but if someone wishes to implement a Voice Activity detector function they have the template, documentation and a simple threshold to play with, as well as an example showing simple applications.

The code allows for passing of the frames and the entire signal, which should be flexible enough for anyone to write their own versions depending on their purpose. I considered using the frame power provided as the first MFCC, but decided that this was overall more flexible, and allowed comparison to the entire signal at once.

This is a modification I made for my thesis in which I used your code to extract the MFCCs from a bunch of files, and I thought other people may find this handy too.

meresmclr · 2016-07-15T04:44:23Z

@nayyarv very nice -- could you fix the conflicts with existing code for ease of merging/evaluating your changes?

nayyarv · 2016-07-26T07:19:54Z

Haha, it's been some time since I opened this PR!

I'll have a look later this week and see if it's a straightforward update, and see if it's py3 compatible and not too complicated

nayyarv added 5 commits November 5, 2014 15:43

Added VoiceActivityDetection to speechFeatures

33f3923

Some reformatting

ff5dff2

Rolling back reformatting to minimize differences in files

98a8463

Rolling back reformatting to minimize differences in files again

26761b6

Redoing all the changes from scratch again

385af77

SuperGops7 approved these changes May 29, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Voice Activity Detection #5

Voice Activity Detection #5

Uh oh!

nayyarv commented Nov 5, 2014

Uh oh!

meresmclr commented Jul 15, 2016

Uh oh!

nayyarv commented Jul 26, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Voice Activity Detection #5

Are you sure you want to change the base?

Voice Activity Detection #5

Uh oh!

Conversation

nayyarv commented Nov 5, 2014

Uh oh!

meresmclr commented Jul 15, 2016

Uh oh!

nayyarv commented Jul 26, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants