Large Language Models (LLMs) may not be as smart as they seem, according to a study from Apple researchers.
LLMs from OpenAI, Google, Meta, and others have been touted for their impressive reasoning skills. But research suggests their purported intelligence may be closer to “sophisticated pattern matching” than “true logical reasoning.” Yep, even OpenAI’s o1 advanced reasoning model.
The most common benchmark for reasoning skills is a test called GSM8K, but since it’s so popular, there’s a…
Read the full article here