View NSQL Scores for Models
Generate and view leaderboard for LLM evaluations
Calculate GPU requirements for running LLMs
Benchmark AI models by comparison
Teach, test, evaluate language models with MTEB Arena
Pergel: A Unified Benchmark for Evaluating Turkish LLMs
Measure execution times of BERT models using WebGPU and WASM
Analyze model errors with interactive pages
Predict customer churn based on input details
Display benchmark results
Display model benchmark results
Browse and submit LLM evaluations
Benchmark LLMs in accuracy and translation across languages
DuckDB NSQL Leaderboard is a tool designed to track and compare the performance of different models using the NSQL (Normalized SQL) benchmarking framework. It provides a centralized platform to view and analyze NSQL scores, enabling users to evaluate and compare model performance efficiently.
What is NSQL in DuckDB?
NSQL (Normalized SQL) is a benchmarking framework used to evaluate the performance of SQL query engines. It provides a standardized way to measure and compare query execution times across different systems.
How do I interpret the NSQL scores?
Higher NSQL scores generally indicate better performance. Scores are calculated based on the execution time of a suite of SQL queries, with faster execution times resulting in higher scores.
Can I customize the leaderboard view?
Yes, you can customize the leaderboard by filtering, sorting, and selecting specific models to compare. This allows you to focus on the models and metrics that are most relevant to your needs.