Welcome to Headshots-Next-TypeScript-Project Discussions! #1
-
👋 Welcome!We’re using Discussions as a place to connect with other members of our community. We hope that you:
To get started, comment below with an introduction of yourself and tell us about what you do with this community. |
Beta Was this translation helpful? Give feedback.
Replies: 5 comments
-
|
I’m building a production-grade Retrieval-Augmented Generation (RAG) system using LangChain and a vector database (e.g. Pinecone or Chroma) to power an internal knowledge assistant. The system requirements are:
I’m trying to decide on best practices for:
What architecture patterns or practical approaches have worked well for you when running RAG systems in production? Any code examples, diagrams, or real-world lessons learned would be greatly appreciated. Thanks! |
Beta Was this translation helpful? Give feedback.
-
1. Document Chunking StrategyRecommended defaults
Guidelines
This allows selective re-embedding when documents change. 2. Vector Store DesignPinecone (managed)
Chroma (self-hosted)
Indexing strategy
3. RAG Architecture PatternRecommended separation of concerns Key principles
4. Caching Strategy
Common tools:
5. Observability & Hallucination ControlBest practices
Metrics to track
6. Cost Control
7. Security & Enterprise Concerns
8. Example (LangChain – simplified)retriever = vectorstore.as_retriever(
search_type="similarity",
search_kwargs={"k": 5}
)
qa_chain = RetrievalQA.from_chain_type(
llm=llm,
retriever=retriever,
return_source_documents=True
)SummaryA production RAG system should:
|
Beta Was this translation helpful? Give feedback.
-
|
ahh. very good solutioin. thank you very much. |
Beta Was this translation helpful? Give feedback.
-
|
okay good. |
Beta Was this translation helpful? Give feedback.
-
|
Good repository! |
Beta Was this translation helpful? Give feedback.
I’m building a production-grade Retrieval-Augmented Generation (RAG) system using LangChain and a vector database (e.g. Pinecone or Chroma) to power an internal knowledge assistant.
The system requirements are:
I’m trying to decide on best practices for:
Chunking strategy
Vector store design