Skip to content

Evaluation framework for MLLMs on the Odd-One-Out task. Benchmarking spatial reasoning, relational logic, zero-shot anomaly detection in complex multi-object scenes.

Notifications You must be signed in to change notification settings

AgneseRe/Benchmarking-Open-MLLMs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

About

Evaluation framework for MLLMs on the Odd-One-Out task. Benchmarking spatial reasoning, relational logic, zero-shot anomaly detection in complex multi-object scenes.

Topics

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •