Skip to content
View ysharma3501's full-sized avatar

Block or report ysharma3501

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ysharma3501/README.md

Hi there, I'm Yatharth 👋

GitHub Stats

I am an AI researcher focused on pushing the boundaries of generative audio and speech synthesis. My work centers on building high-performance, efficient architectures for TTS and neural audio restoration.

Research & Projects

I specialize in developing novel architectures for speech and rapid inference. Some of my key work includes:

  • LavaSR: A novel architecture for Bandwidth Extension (BWE) and speech restoration. It is designed to be the fastest and most flexible model in its class. (Submitted to Interspeech 2026).
  • LuxTTS: A high-quality, rapid voice cloning model reaching speeds of 150x realtime through advanced distillation techniques.
  • NovaSR: A lightning-fast audio upsampler utilizing a novel architecture for high-fidelity BWE.
  • MiraTTS: Emotionally fine-tuned Spark-TTS integrated with a custom-built upsampler for expressive, high-resolution speech.
  • LinaCodec: A highly compressive neural audio codec(compressing 60x more then previous codecs) optimized for speech models.

🤝 Collaboration

I am actively looking to collaborate on writing and publishing research papers in the deep learning and audio DSP space. If you're working on novel speech architectures or efficient transformer scaling, would be happy to connect.


GitHub | X / Twitter | LinkedIn

Pinned Loading

  1. LuxTTS LuxTTS Public

    A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.

    Python 1.8k 227

  2. NovaSR NovaSR Public

    A lightning fast audio upsampler.

    Python 739 70

  3. MiraTTS MiraTTS Public

    A high quality and fast TTS repository

    Python 506 42

  4. LavaSR LavaSR Public

    🌋LavaSR: Fast Speech restoration and enhancement

    Python 459 41

  5. LinaCodec LinaCodec Public

    A highly compressive and high-quality neural audio codec for speech models.

    Python 261 25

  6. FlashSR FlashSR Public

    Fast audio super resolution from 16khz to 48khz.

    Python 200 19