Display a treemap of languages and datasets
Calculate VRAM requirements for running large language models
Profile a dataset and publish the report on Hugging Face
Analyze and visualize data with various statistical methods
Need to analyze data? Let a Llama-3.1 agent do it for you!
Visualize amino acid changes in protein sequences interactively
Gather data from websites
Browse LLM benchmark results in various categories
Generate synthetic dataset files (JSON Lines)
Search for tagged characters in Animagine datasets
Generate a detailed dataset report
Display server status information
VLMEvalKit Evaluation Results Collection
Corpus Map is a data visualization tool designed to help users explore and analyze datasets through an interactive treemap representation. It allows for the visualization of languages and datasets in a hierarchical and organized manner, making it easier to understand complex data structures at a glance.
What datasets are compatible with Corpus Map?
Corpus Map supports a wide range of dataset formats, including CSV, JSON, and Excel files. For best results, ensure your dataset is properly structured and categorized.
Can I customize the colors and styles of the treemap?
Yes, Corpus Map allows users to customize colors, fonts, and other visual elements to suit their preferences or brand guidelines.
How do I interpret the size and color of the nodes?
The size of each node typically represents the relative size or importance of the dataset or language, while the color can be customized to represent different categories or metrics. Refer to the legend for exact interpretations.