AI hype is built on high test scores. Those tests are flawed.

@faun ・ Oct 25,2023

https://www.technologyreview.com/2023/08/30/1078670/large-la...

AI hype is built on high test scores. Those tests are flawed.

Large language models like GPT-3 and GPT-4 have shown impressive capabilities, passing tests that mimic human reasoning and intelligence. However, there is little agreement on what these results actually mean and how to interpret them, leading researchers to call for more rigorous evaluation methods and an avoidance of human-based tests.

Share with your friends and followers