Fine tuning Whisper for Speech Transcription

Multimodal RAG System: PDF + Audio Processing with LLaVA | T5 Fine-Tuning | Flask DeploymentПодробнее

Multimodal RAG System: PDF + Audio Processing with LLaVA | T5 Fine-Tuning | Flask Deployment

Live CC: Learning Video LLM with Streaming Speech Transcription at Scale (Apr 2025)Подробнее

Live CC: Learning Video LLM with Streaming Speech Transcription at Scale (Apr 2025)

AI-Powered Speech Recognition with Dialect Adaptation | Whisper + NLP Correction DemoПодробнее

AI-Powered Speech Recognition with Dialect Adaptation | Whisper + NLP Correction Demo

Multilingual Speaker ID with Whisper: A Deep DiveПодробнее

Multilingual Speaker ID with Whisper: A Deep Dive

Resolving the Weird Character Output in Hugging Face's Whisper Model Fine-TuningПодробнее

Resolving the Weird Character Output in Hugging Face's Whisper Model Fine-Tuning

Fine-tuning Whisper on ATC Data | 5.8% Word Error Rate (WER) + DemoПодробнее

Fine-tuning Whisper on ATC Data | 5.8% Word Error Rate (WER) + Demo

Fine-tuning شرح مبسط | AI Fine-Tuning Simple TutorialПодробнее

Fine-tuning شرح مبسط | AI Fine-Tuning Simple Tutorial

Take a Shot!: Natural Language Control of Robotic X-ray Systems for Image-guided SurgeryПодробнее

Take a Shot!: Natural Language Control of Robotic X-ray Systems for Image-guided Surgery

Fine-tuning Whisper to learn my mother tongue ODIA || PART-4Подробнее

Fine-tuning Whisper to learn my mother tongue ODIA || PART-4

Word Error Rate in Atomatic Speech Recognition || Evaluation of Whisper || PART-2Подробнее

Word Error Rate in Atomatic Speech Recognition || Evaluation of Whisper || PART-2

Master Fine-Tuning OpenAI Whisper with PyTorch for Custom ASR Tasks || PART-1Подробнее

Master Fine-Tuning OpenAI Whisper with PyTorch for Custom ASR Tasks || PART-1

Multi modal Audio + Text Fine tuning and Inference with QwenПодробнее

Multi modal Audio + Text Fine tuning and Inference with Qwen

ICNLSP 2024: Thonburian Whisper: Robust Fine-tuned and Distilled Whisper for ThaiПодробнее

ICNLSP 2024: Thonburian Whisper: Robust Fine-tuned and Distilled Whisper for Thai

Fine tune and Serve Faster Whisper TurboПодробнее

Fine tune and Serve Faster Whisper Turbo

(AI Tinkerers Ottawa) Fine tuning Whisper with PEFT LORA w/ Rishab BahalПодробнее

(AI Tinkerers Ottawa) Fine tuning Whisper with PEFT LORA w/ Rishab Bahal

Text to Speech Fine-tuning TutorialПодробнее

Text to Speech Fine-tuning Tutorial

Fine Tuning Whisper for Automatic Speech Recognition | Punjabi Language | Transcribe | PEFTПодробнее

Fine Tuning Whisper for Automatic Speech Recognition | Punjabi Language | Transcribe | PEFT

Reproducing Whisper-style Training Using an open-source toolkit and publicly available Data | KurianПодробнее

Reproducing Whisper-style Training Using an open-source toolkit and publicly available Data | Kurian

Google's Universal Speech Model for 100+ languages beats OpenAI's Whisper ModelПодробнее

Google's Universal Speech Model for 100+ languages beats OpenAI's Whisper Model

[Demo] Empowering Indian Languages’ Subtitling with LLMs | Indic Subtitler | The Fifth ElephantПодробнее

[Demo] Empowering Indian Languages’ Subtitling with LLMs | Indic Subtitler | The Fifth Elephant

События