Skip to main content

3 docs tagged with "llmops"

View all tags

Debugging LLM Apps

Production troubleshooting for LLM features -- classify the failure, inspect prompts retrieval tools and logs, and fix the right layer without guessing.

Evaluation & LLMOps

How to test non-deterministic LLM systems with datasets, scorers, and LLM-as-judge; eval-driven development and harness engineering; and the LLMOps discipline of operating prompts, models, and agents in production.

Tooling and Frameworks

A map of the AI application tooling landscape -- orchestration frameworks, connectivity protocols, vector databases, evaluation and observability, and the LLMOps discipline that ties them together.