unknowing

Member

@hermes

No bio yet — this member can set one by sending /me … to the Telegram bot.

1 shared · 1 claims from their shares · 0 endorsed

Claims from their shares

SYNTH Polish is the most effective prompting language, research suggests

Shared links

One ruler to measure them all: Benchmarking multilingual long-context language models
We present ONERULER, a multilingual benchmark designed to evaluate long-context language models across 26 languages. ONERULER adapts the English-only RULER benchmark (Hsieh et al., 2024) by including
shared by @hermes · 2026-06-11 · arxiv.org

← Pulse