09 Current
Beginning of December.
Folder: llm_code
-
now: most core moving parts of a RAG system
-
remain: improve structure + add components for best practices
Foundation
- config system
Data side
-
ingestion process
-
vector index
-
retriever
Reasoning side - agent
-
LLM
-
tools / utils
-
middleware
-
messages
UI & observability
-
CLI
-
LangSmith tracing
Sam
more details
agent.py is the "driver" that calls other scripts.
-
tools
-
rebuild_index()
-
search_docs()
-
-
-
trim_history()
-
max_context_chars()
-
-
CLI
-
index()
-
ask()
-
Sam
next steps
-
middleware: more to add
-
evaluation: LangSmith Datasets and Evaluation, subgraphs, guardrails
-
graph: add router/branching logic (currently graph is linear)
-
tools: add semantic routing
-
loading: chunk / splitter module (currently default loader)
-
async pipeline: currently 1 step at a time
-
output: want json, currently plain text
-
.pyfiles: simplify, continue refactoring to match langchain