Anthropic unveils BioMysteryBench to test Claude's bioinformatics skills against human experts, and says Mythos solved ~30% of 23 questions that stumped experts (Anthropic)

Anthropic:
Anthropic unveils BioMysteryBench to test Claude's bioinformatics skills against human experts, and says Mythos solved ~30% of 23 questions that stumped experts  —  In this post, Brianna, a researcher on the discovery team, shares results from a recent bioinformatics benchmarking effort.



from Techmeme https://ift.tt/VrC6Ffo

Post a Comment

0 Comments