Skip to content

Add dataset processing script to align clean and noisy speech via STFT#28

Open
aryanKaga wants to merge 1 commit intomicrosoft:masterfrom
aryanKaga:master
Open

Add dataset processing script to align clean and noisy speech via STFT#28
aryanKaga wants to merge 1 commit intomicrosoft:masterfrom
aryanKaga:master

Conversation

@aryanKaga
Copy link

Summary

This pull request adds a script that processes clean and noisy .wav files in the MS-SNSD dataset, extracts their STFT magnitude spectrograms, aligns them by time, and stores them in .npz format for efficient machine learning training.

Features

  • Matches noisy and clean speech files using clnsp ID extraction
  • Applies convert_to_magnitude() from stft.py
  • Crops clean/noisy STFT matrices to the shortest length

Let me know if you'd prefer a different format or integration method. Happy to update the structure

@aryanKaga
Copy link
Author

@microsoft-github-policy-service agree

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant