Introduction
I’ve spent 15+ years building software that makes life easier, from creating web and mobile apps to designing the back‑end systems that power them. My years of development work have taught me to build quickly, learn from mistakes, and embrace change. Now I’m bringing that same energy to AI and machine learning, exploring how language models, speech technologies and automation can improve user experiences and simplify complex workflows. I’m committed to blending my full‑stack skills with these emerging AI tools to build useful, human‑centric products.
Recent activities related to language learning and AI integration.
Local LLM with Ollama
I experimented with running various models locally using Ollama, testing function calling and local integration.
Related blog posts: #function-calling
TTS(Text-to-Speech) for a Japanese Documentary Video
I worked on a project to convert a PowerPoint slideshow file into a fully narrated Japanese documentary video, The skills involved are:
- generate voiceover audio file from a text script using the Google TTS API.
- use LLM to automate SSML annotation to control the pronunciation of Japanese words, especially for names and places.
- use iMovie to mix the static slide images with the audio narration.
STT (Speech-to-Text) for Webinar Transcription
I worked on a project to transcribe spoken Cantonese language into text, The skills involved are:
- use whisper.cpp to convert audio recordings into timestamped text transcripts.
- fix incorrect words in the transcripts manually.
The video is about Teaching Cantonese at Home by Dr. Chaak Ming Lau: https://www.youtube.com/watch?v=TW6UiDrejVI
Learn more about Dr. Chaak Ming Lau at chaak.net
Cantonese language preservation
I am a native Cantonese speaker and I am passionate about preserving the Cantonese language and culture.
In 2024, I started volunteering at Familogue.ca, a local non-profit organization dedicated to supporting Cantonese parenting and promoting the use of the language. My responsibilities include building websites and automation, creating educational materials, and leading storytelling sessions for children.
I am also keen on using technology to aid language learning and preservation efforts. Some future projects include developing automated story generation tools for children, and exploring the use of AI in language education.
MCP (Model Context Protocol)
Experimenting with Model Context Protocol (MCP) to improve the productivity of LLMs in various scenarios, including accounting data analysis and document summarization.
Related blog posts: #mcp