View and submit LLM benchmark evaluations
Evaluate LLM over-refusal rates with OR-Bench
Find recent high-liked Hugging Face models
Leaderboard of information retrieval models in French
Display and submit LLM benchmarks
Multilingual Text Embedding Model Pruner
Compare audio representation models using benchmark results
Browse and filter ML model leaderboard data
View and submit machine learning model evaluations
Evaluate adversarial robustness using generative models
Evaluate code generation with diverse feedback types
Teach, test, evaluate language models with MTEB Arena
Quantize a model for faster inference
Aiera Finance Leaderboard is a model benchmarking tool designed to provide insights into the performance of large language models (LLMs) within the financial domain. It enables users to view and submit evaluations of various LLMs, fostering transparency and community-driven insights into AI performance in finance.
What is the purpose of Aiera Finance Leaderboard?
Aiera Finance Leaderboard is designed to help users understand and compare the performance of different LLMs in financial contexts, enabling better decision-making for AI adoption.
Can anyone submit evaluations to the leaderboard?
Yes, the platform allows users to submit their own evaluations, contributing to a community-driven benchmarking process.
How often are the rankings updated?
The rankings are updated in real-time as new evaluations are submitted, ensuring the most current and accurate representation of LLM performance.