Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

We introduce representative generative benchmarking—custom eval sets built from your own data that reflect real user queries. thank you for collaborating! link to report in replies

74,443 Aufrufe • vor 1 Jahr •via X (Twitter)

9 Kommentare

Profilbild von Chroma
Chromavor 1 Jahr

link to technical report: Grounded in experiments with production data, our method captures performance differences that public benchmarks like MTEB miss.

Profilbild von RTTS
RTTSvor 1 Jahr

API testing of interfaces is critical to determine if they meet requirements for functionality, reliability, performance, and security. Check out RTTS - the automated testing experts since 1996. #API #testautomation #integrationtest

Profilbild von Sumuk
Sumukvor 1 Jahr

@weights_biases this is super cool! at 🤗Huggingface we introduced a generative open source system but for full LLM evals instead! would be great to collab!

Profilbild von swyx
swyxvor 1 Jahr

@weights_biases you guys somehow made notebooks look good, incredible

Profilbild von Aarush Sah
Aarush Sahvor 1 Jahr

@weights_biases YES YES YES YES

Profilbild von ebaad
ebaadvor 1 Jahr

@weights_biases Jealous of that @HermanMiller chair, how can I get a job at @trychroma.

Profilbild von LA Bloke
LA Blokevor 1 Jahr

@weights_biases Perhaps, you should use AI to reformat your message/paper?

Profilbild von Ryan
Ryanvor 1 Jahr

@weights_biases This is what I need to do manually at the moment. very curious to see what this is capable of.

Profilbild von Allan Ryan
Allan Ryanvor 1 Jahr

@weights_biases Kelly is legit

Ähnliche Videos