← back to work2025–
prototype
Veloci
Multilingual AI news — 7 Indian languages
Next.jsMultilingual LLMpgvectorTypeScript
─── 01
Seven languages, not one
A multilingual AI news surface covering Telugu, Tamil, Kannada, Malayalam, Marathi, Hindi, Bengali. Each language gets its own retrieval, summarization, and rendering path. Shared orchestration only.
Status: prototype. Reader UI shipped; ingestion + summarization pipeline runs locally, not yet productionized.
─── 02
Stack (current + planned)
- Reader — Next.js, TypeScript, Tailwind.
- Summarization — a multilingual LLM with per- language prompt templates.
- Embeddings — multilingual-e5 for cross- publisher dedup where it makes sense.
- Backend (planned) — worker service polling publishers, writing to Postgres + pgvector.
─── 03
What’s hard
- Per-language evaluation data isn’t off-the- shelf. Building a small hand-labeled set per language is the bottleneck.
- Cross-publisher dedup is noisy. Same story, different wording. Source-citation per claim is the discipline that keeps it honest.
- Editorial voice differs per region. Translating-and-flattening produces translated-English. Per- language prompt templates preserve cadence.