I don’t care a lot about mathematical tasks, but code intellingence is a minor preference but the most anticipated one is overall comprehension, intelligence. (For RAG and large context handling) But anyways any benchmark with a wide variety of models is something I am searching for, + updated.

  • thickertoofan@lemm.eeOP
    link
    fedilink
    English
    arrow-up
    0
    ·
    22 days ago

    I checked mostly all of em out from the list, but 1b models are generally unusable for RAG.