Loading video...

Video Failed to Load

Go Home

We introduce representative generative benchmarking—custom eval sets built from your own data that reflect real user queries. thank you for collaborating! link to report in replies

74,443 views • 1 year ago •via X (Twitter)

9 Comments

Chroma's profile picture
Chroma1 year ago

link to technical report: Grounded in experiments with production data, our method captures performance differences that public benchmarks like MTEB miss.

RTTS's profile picture
RTTS1 year ago

API testing of interfaces is critical to determine if they meet requirements for functionality, reliability, performance, and security. Check out RTTS - the automated testing experts since 1996. #API #testautomation #integrationtest

Sumuk's profile picture
Sumuk1 year ago

@weights_biases this is super cool! at 🤗Huggingface we introduced a generative open source system but for full LLM evals instead! would be great to collab!

swyx's profile picture
swyx1 year ago

@weights_biases you guys somehow made notebooks look good, incredible

Aarush Sah's profile picture
Aarush Sah1 year ago

@weights_biases YES YES YES YES

ebaad's profile picture
ebaad1 year ago

@weights_biases Jealous of that @HermanMiller chair, how can I get a job at @trychroma.

LA Bloke's profile picture
LA Bloke1 year ago

@weights_biases Perhaps, you should use AI to reformat your message/paper?

Ryan's profile picture
Ryan1 year ago

@weights_biases This is what I need to do manually at the moment. very curious to see what this is capable of.

Allan Ryan's profile picture
Allan Ryan1 year ago

@weights_biases Kelly is legit

Related Videos