Add dataset processing script to align clean and noisy speech via STFT by aryanKaga · Pull Request #28 · microsoft/MS-SNSD

aryanKaga · 2025-07-23T07:24:10Z

Summary

This pull request adds a script that processes clean and noisy .wav files in the MS-SNSD dataset, extracts their STFT magnitude spectrograms, aligns them by time, and stores them in .npz format for efficient machine learning training.

Features

Matches noisy and clean speech files using clnsp ID extraction
Applies convert_to_magnitude() from stft.py
Crops clean/noisy STFT matrices to the shortest length

Let me know if you'd prefer a different format or integration method. Happy to update the structure

aryanKaga · 2025-07-23T07:33:15Z

@microsoft-github-policy-service agree

Add files via upload

e761323

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add dataset processing script to align clean and noisy speech via STFT#28

Add dataset processing script to align clean and noisy speech via STFT#28
aryanKaga wants to merge 1 commit intomicrosoft:masterfrom
aryanKaga:master

aryanKaga commented Jul 23, 2025

Uh oh!

aryanKaga commented Jul 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

aryanKaga commented Jul 23, 2025

Summary

Features

Uh oh!

aryanKaga commented Jul 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant