View and submit LLM benchmark evaluations
Launch web-based model application
Optimize and train foundation models using IBM's FMS
Explore GenAI model efficiency on ML.ENERGY leaderboard
Evaluate LLM over-refusal rates with OR-Bench
Leaderboard of information retrieval models in French
Merge machine learning models using a YAML configuration file
Browse and submit evaluations for CaselawQA benchmarks
Quantize a model for faster inference
Calculate GPU requirements for running LLMs
Download a TriplaneGaussian model checkpoint
Pergel: A Unified Benchmark for Evaluating Turkish LLMs
Track, rank and evaluate open LLMs and chatbots
Aiera Finance Leaderboard is a model benchmarking tool designed to provide insights into the performance of large language models (LLMs) within the financial domain. It enables users to view and submit evaluations of various LLMs, fostering transparency and community-driven insights into AI performance in finance.
What is the purpose of Aiera Finance Leaderboard?
Aiera Finance Leaderboard is designed to help users understand and compare the performance of different LLMs in financial contexts, enabling better decision-making for AI adoption.
Can anyone submit evaluations to the leaderboard?
Yes, the platform allows users to submit their own evaluations, contributing to a community-driven benchmarking process.
How often are the rankings updated?
The rankings are updated in real-time as new evaluations are submitted, ensuring the most current and accurate representation of LLM performance.