Display a treemap of languages and datasets
Analyze and visualize Hugging Face model download stats
Analyze autism data and generate detailed reports
Try the Hugging Face API through the playground
Multilingual metrics for the LMSys Arena Leaderboard
View and compare pass@k metrics for AI models
Generate detailed data profile reports
Explore token probability distributions with sliders
Generate synthetic dataset files (JSON Lines)
Analyze weekly and daily trader performance in Olas Predict
Explore income data with an interactive visualization tool
Profile a dataset and publish the report on Hugging Face
Transfer GitHub repositories to Hugging Face Spaces
Corpus Map is a data visualization tool designed to help users explore and analyze datasets through an interactive treemap representation. It allows for the visualization of languages and datasets in a hierarchical and organized manner, making it easier to understand complex data structures at a glance.
What datasets are compatible with Corpus Map?
Corpus Map supports a wide range of dataset formats, including CSV, JSON, and Excel files. For best results, ensure your dataset is properly structured and categorized.
Can I customize the colors and styles of the treemap?
Yes, Corpus Map allows users to customize colors, fonts, and other visual elements to suit their preferences or brand guidelines.
How do I interpret the size and color of the nodes?
The size of each node typically represents the relative size or importance of the dataset or language, while the color can be customized to represent different categories or metrics. Refer to the legend for exact interpretations.