09 Current

09 Current

Beginning of December.

Folder: llm_code

now: most core moving parts of a RAG system
remain: improve structure + add components for best practices

Foundation

config system

Data side

ingestion process
vector index
retriever

Reasoning side - agent

LLM
tools / utils
middleware
messages

UI & observability

CLI
LangSmith tracing

Sam

more details agent.py is the "driver" that calls other scripts.

tools
- rebuild_index()
- search_docs()
middleware
- trim_history()
- max_context_chars()
CLI
- index()
- ask()

Sam

next steps

middleware: more to add
evaluation: LangSmith Datasets and Evaluation, subgraphs, guardrails
graph: add router/branching logic (currently graph is linear)
tools: add semantic routing
loading: chunk / splitter module (currently default loader)
async pipeline: currently 1 step at a time
output: want json, currently plain text
chat UI
.py files: simplify, continue refactoring to match langchain