Upload files to a Hugging Face repository
Display translation benchmark results from NTREX dataset
ReWrite datasets with a text instruction
Manage and label data for machine learning projects
Create and manage AI datasets for training models
Display trending datasets from Hugging Face
Create a large, deduplicated dataset for LLM pre-training
Display trending datasets and spaces
Access NLPre-PL dataset and pre-trained models
Create a report in BoAmps format
Browse a list of machine learning datasets
Manage and orchestrate AI workflows and datasets
Search and find similar datasets
Dadada is a tool designed for dataset creation and management, specifically optimized for uploading files to Hugging Face repositories. It simplifies the process of organizing, storing, and sharing datasets, making it easier for researchers and developers to collaborate on machine learning projects.
• Integration with Hugging Face Hub: Directly upload datasets to Hugging Face repositories. • File Management: Organize and store datasets in a structured and accessible format. • Collaboration Support: Share datasets with team members or the public. • Customizable Metadata: Add descriptions, tags, and other metadata to datasets for better organization. • Version Control: Track changes and manage different versions of datasets.
What is the maximum file size I can upload?
The maximum file size for uploads depends on your Hugging Face Hub account limits. Free accounts typically have a 5 GB storage limit.
Can I upload multiple files at once?
Yes, Dadada supports batch uploads, allowing you to upload multiple files simultaneously.
How do I share my dataset with others?
You can share your dataset by making it public on Hugging Face Hub or by inviting collaborators via email.