-

Enterprise Document Intelligence [Vol.1 #M1] – The thesis behind every architectural choice in this series
20 min read -

Enterprise Document Intelligence [Vol.1 #7C] – One LLM call ranks the candidates with reasons. The…
31 min read -

Finding the right anchors for RAG: keyword, embedding, and TOC signals in parallel
Large Language ModelsEnterprise Document Intelligence [Vol.1 #7B] – Retrieval is filtering on structured tables: keywords first, TOC…
33 min read -

Enterprise Document Intelligence [Vol.1 #7A] – Stop searching strings. Filter line_df and toc_df. Pick anchors…
21 min read -

Enterprise Document Intelligence [Vol.1 #6bis] – Ask one focused clarification, learn the default from the…
11 min read -

Dispatching the Parsed RAG Question: Chunk Strategy, Model Tier, Activations, Audit
Large Language ModelsEnterprise Document Intelligence [Vol.1 #6c] – The decisions the parser makes on top of the…
28 min read -

Five fields RAG should extract from any question: keywords, scope, shape, decomposition, clarification
Large Language ModelsEnterprise Document Intelligence [Vol.1 #6b] – The five field families the parser reads straight from…
31 min read -

Enterprise Document Intelligence [Vol.1 #6a] – Why a user question deserves the same parsing as…
13 min read -

Enterprise Document Intelligence [Vol.1 #4] – A diagnostic across PDFs and questions, and a map…
23 min read -

Enterprise Document Intelligence [Vol.1 #3] – Why the ML toolkit (hyperparameter sweeps, train/test splits, explainability…
30 min read