software

Secure and scalable speech transcription for local and HPC

Post

A production-ready local transcription workflow leveraging OpenAI's Whisper models that addresses the limitations of cloud-based solutions through complete data sovereignty, unlimited scale, reproducible processing and advanced quality control, while maintaining GDPR compliance.

Secure and scalable speech transcription for local and HPC

Publication

A production-ready, local transcription workflow using OpenAI's Whisper, designed for security, scalability on HPC, and advanced quality control. It overcomes the privacy and reproducibility limitations of cloud-based services, offering a robust alternative for academic and enterprise use.

WebVTT caption transcription app

Application / dashboard

Open-source R-based application that converts video captions from WebVTT format into plain text by automatically removing timestamps and formatting the content into accessible documents.