Anthropic:
Anthropic unveils BioMysteryBench to test Claude's bioinformatics skills against human experts, and says Mythos solved ~30% of 23 questions that stumped experts — In this post, Brianna, a researcher on the discovery team, shares results from a recent bioinformatics benchmarking effort.
from Techmeme https://ift.tt/VrC6Ffo
0 Comments