Loading video...
Video Failed to Load
DSPy is great at classification tasks, but it's hard to make a formal eval metric for 'fuzzy' creative tasks like writing a blog post or telling a joke. Solution: train a judge with DSPy to agree with you 90%+, then use that judge as the eval metric for the... show more
18,654 views • 9 months ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here
