Building Sidi - charmayne

Sidi is a Chinese reading tool for serious Chinese learners looking to build real fluency by recognizing patterns in the structure of Chinese characters.

Why I’m building this

Cover

16 Feb 2026

back after a bit of a break, during which i procrastinated by building two small tools: a library for all of my markdown artifacts and a reading tracker. making mini tools just for yourself is the most useless and delightful thing.

here’s how i’m phasing out the MVP development (with claude code):

Phase 0: data import and infrastructure
Phase 1: core lookup (search and detail pages - no auth)
Phase 2: generation (LLM pipeline)
Phase 3: auth and save to deck
Phase 4: further design, testing, deploy

current progress: ✅ phase 0 - 3 complete

12 Jan 2026

MVP has two flows:

Lookup: Search → result → detail page
Save to deck: Requires login, lets you build a personal deck

There are 5 “levels” of entities that need different treatment.

Compound word (e.g., 电脑): Multi-character word
Composite character (e.g., 明): Built from multiple components
Simple character (e.g., 日): Single semantic unit
Primitive (e.g., 氵): Modifies meaning, rarely stands alone
“Derived” primitive: Variant form of another component

Each level needs a different design treatment. L1 shows word-level breakdown, L2 shows character components, and L3–L5 show etymology and usage.

What I’m working on next:

Finalize detail page design for each entity level
Build search + result flow
Design the AI prompt for generating breakdowns
Figure out save/deck system (login required)

Previously

Before Jan 9: The manual approach

I started with the idea of building my own dataset, but it seemed high friction and not very sustainable. Building a comprehensive, accurate dataset would take a lot of effort.

Landed on a scoped v0. I would curate 500–700 characters, Simplified Chinese only, 1-2 core user flows, and see how it goes

I also decided on web app over iOS for now to have the lowest possible barrier to iterating / shipping without dealing with app store reviews.

Jan 9: wait, i can just use AI

Realised that instead of curating a static dataset, I could use AI for the character breakdowns. This shifts the entire approach. I can support the full character set from day one and ship much faster.

My thinking on ICP has shifted. Heritage learners were my initial focus, but the opportunity is broader: serious, intermediate learners frustrated with retention and comprehension. The pain point is universal.

I looked at the competitive landscape more closely and did some product teardowns:

HanziHero uses English mnemonics, a different approach that relies on memory tricks rather than structural understanding.
Hanly is the closest thing to what I'm building. It has component breakdowns, but they're baked into a structured HSK 0–6 course. It's completely beginner-focused, which means it's not useful if you're already at an intermediate level.

Notes for later: I should consider adding stroke order, and spaced repetition flashcards for feature parity, but it's not critical for v0

Considered going mobile again because demand validation is much stronger now, but chose to stick with web to ship faster