
Respan
@RespanAI • 1,716 subscribers
Self-driving observability and evals for agents. Formerly Keywords AI (YC W24). Community: https://t.co/JAZ9auu0hr
Videos

Respan Launch Week, Day 2: Evals Evals are one of the hardest parts of building AI applications. It is not because teams cannot run them. It is because they are hard to structure, hard to compare, and even harder to improve over time. So teams end up guessing. Did this prompt actually get better? Is this model really an improvement? We built Evals in Respan to make this systematic. You define: - what you want to test, such as prompts, models, or configs - the dataset, from production logs or test cases - the evaluators, whether LLM, code, human, or a mix Then you run experiments and compare results side by side. Same inputs. Same evaluation. Clear answers. No more guessing. Start running your first eval.
Respan127,602 Aufrufe • vor 2 Monaten

We worked with OpenAI to build a native integration for the OpenAI Agents SDK. Today, we are very excited to launch as a tracing processor. With just a few lines of code, you can trace all your agent workflows and debug them much faster. Check out this quick demo to get started:
Respan58,055 Aufrufe • vor 1 Jahr
Keine weiteren Inhalte verfügbar