Case Study

Category:
Speech & Healthcare AI
Impact:
6 months | $425k Seed Funding
Healthcare professionals often spend significant time documenting patient visits, leading to increased administrative burden and reduced focus on patient care. For Arabic-speaking regions—particularly in dialect-rich contexts like Saudi Arabia existing speech recognition systems fail to capture medical terminology and dialectal nuances accurately. Sahl.ai required a custom ASR (Automatic Speech Recognition) system to convert spoken medical consultations into precise, structured clinical notes.
Built end-to-end training and evaluation pipelines for ASR models. Conducted 200+ experiments across datasets to refine accuracy and robustness.
Applied LoRA and QLoRA fine-tuning, optimizing large ASR models for medical speech.
Generated synthetic speech data to expand training resources. Developed self-supervised ASR methods, doubling productivity in data annotation and collection.
Supervised teams for speech dataset collection and medical annotation. Designed audio cleaning pipelines to improve transcription quality.
Delivered a functional MVP, enabling the startup to validate the product with investors.