Mark Sullivan / Fast Company:
ARC Prize Foundation unveils ARC-AGI-3, an AI benchmark with simple video-game-like scenarios designed to measure on-the-fly reasoning rather than memory recall — ARC-AGI-3 tests whether models can reason through novel problems, not just recall patterns, a task even top systems still struggle to do.
from Techmeme https://ift.tt/iwKaleo
0 comments:
Please do not enter any spam in the comment box!