3 docs tagged with "llmops"

Debugging LLM Apps

Production troubleshooting for LLM features - classify the failure, inspect prompts retrieval tools and logs, and fix the right layer without guessing.

How to test non-deterministic LLM systems with datasets, scorers, and LLM-as-judge; eval-driven development and harness engineering; and the LLMOps discipline of operating prompts, models, and agents in production.

Tooling and Frameworks

A map of the AI application tooling landscape - -orchestration frameworks, connectivity protocols, vector databases, evaluation and observability, and the LLMOps discipline that ties them together.

Debugging LLM Apps

Evaluation & LLMOps

Tooling and Frameworks