Indexing
- Content-Addressable Storage
—
Deduplication
,
Indexing
,
Integrity
and +1 more
Principles of content-addressable storage (CAS) and Merkle trees; focusing on cryptographic content hashing, sharding layouts, deduplication, and block-level verification.
- Retrieval & RAG
—
Data-Pipelines
,
Embeddings
,
Indexing
and +2 more
Operational principles for robust search retrieval and RAG pipelines; focusing on hybrid lexical-semantic retrieval techniques, long-term embedding model stability, automated ranking evaluation, and privacy-aware indexing.
- Search & Retrieval Engine
—
Indexing
,
Machine-Learning
,
Monitoring
and +2 more
A high-performance search and retrieval engine architecture designed for extensive document and media collections; strictly ensuring low-latency ranking and horizontally scalable inverted indexing.
- Proximity Service for Maps
—
Caching
,
Databases
,
Geospatial
and +1 more
An optimized system design for discovering nearby points of interest with ultra-low latency; deeply focusing on efficient spatial indexing via Geohashing, Quadtrees, and read-heavy caching tiers.
- Grit
—
Extensibility
,
Indexing
,
Search-Algorithms
and +1 more
A from‑scratch Git implementation in Rust; exploring content-addressable storage, plumbing/porcelain layering, and high-performance object caching.
- Ragchain
—
Embeddings
,
Indexing
,
Machine-Learning
and +2 more
A comprehensive local RAG stack (ChromaDB + Ollama) designed for strictly private, reproducible retrieval and LLM inference; heavily focusing on hybrid retrieval strategies and index versioning.