Загрузка видео...

Не удалось загрузить видео

На главную

We introduce representative generative benchmarking—custom eval sets built from your own data that reflect real user queries. thank you for collaborating! link to report in replies

74,443 просмотров • 1 год назад •via X (Twitter)

Комментарии: 9

Фото профиля Chroma
Chroma1 год назад

link to technical report: Grounded in experiments with production data, our method captures performance differences that public benchmarks like MTEB miss.

Фото профиля RTTS
RTTS1 год назад

API testing of interfaces is critical to determine if they meet requirements for functionality, reliability, performance, and security. Check out RTTS - the automated testing experts since 1996. #API #testautomation #integrationtest

Фото профиля Sumuk
Sumuk1 год назад

@weights_biases this is super cool! at 🤗Huggingface we introduced a generative open source system but for full LLM evals instead! would be great to collab!

Фото профиля swyx
swyx1 год назад

@weights_biases you guys somehow made notebooks look good, incredible

Фото профиля Aarush Sah
Aarush Sah1 год назад

@weights_biases YES YES YES YES

Фото профиля ebaad
ebaad1 год назад

@weights_biases Jealous of that @HermanMiller chair, how can I get a job at @trychroma.

Фото профиля LA Bloke
LA Bloke1 год назад

@weights_biases Perhaps, you should use AI to reformat your message/paper?

Фото профиля Ryan
Ryan1 год назад

@weights_biases This is what I need to do manually at the moment. very curious to see what this is capable of.

Фото профиля Allan Ryan
Allan Ryan1 год назад

@weights_biases Kelly is legit

Похожие видео