Colvert

In my free time, I’m working on a toy project named: Colvert. It’s allowed me to test some ideas and play with technology I’m interested (Python, DuckDB, HTMX). But more importantly, it’s software I’m using for my personal needs.

It’s fast UX that allows exploring large CSV/Parquet files using SQL. It’s refreshed as you type and get a graphic with one click. It’s much faster than a spreadsheet and as a developer I feel SQL more comfortable.

Their is a toy LLM integration for text to SQL. It’s domain I want to explore more this year.

Read more...

Oreille

A small week end project: Oreille is a wrapper on OpenAPI Whisper API. It provides support for long audio files.

OpenAPI Whisper support only files that are less than 25 MB. Oreille will break the audio file into chunks of 25 MB’s or less. https://platform.openai.com/docs/guides/speech-to-text/longer-inputs

Oreille will also compute the correct timing of the subtitle when merging the output of Whisper. So once you export the subtitle the timestamp of the subtitle will be right.

You can open and save WAV files with pure python. For opening and saving non-wav files – like mp3 – you’ll need ffmpeg or libav.

View project on Github