Upload files to a Hugging Face repository
Explore recent datasets from Hugging Face Hub
Browse TheBloke models' history
Build datasets using natural language
Manage and analyze datasets with AI tools
Data annotation for Sparky
Manage and label data for machine learning projects
Create a domain-specific dataset seed
Validate JSONL format for fine-tuning
Count tokens in datasets and plot distribution
Generate synthetic datasets for AI training
Upload files to a Hugging Face repository
List of French datasets not referenced on the Hub
Dadada is a tool designed for dataset creation and management, specifically optimized for uploading files to Hugging Face repositories. It simplifies the process of organizing, storing, and sharing datasets, making it easier for researchers and developers to collaborate on machine learning projects.
• Integration with Hugging Face Hub: Directly upload datasets to Hugging Face repositories. • File Management: Organize and store datasets in a structured and accessible format. • Collaboration Support: Share datasets with team members or the public. • Customizable Metadata: Add descriptions, tags, and other metadata to datasets for better organization. • Version Control: Track changes and manage different versions of datasets.
What is the maximum file size I can upload?
The maximum file size for uploads depends on your Hugging Face Hub account limits. Free accounts typically have a 5 GB storage limit.
Can I upload multiple files at once?
Yes, Dadada supports batch uploads, allowing you to upload multiple files simultaneously.
How do I share my dataset with others?
You can share your dataset by making it public on Hugging Face Hub or by inviting collaborators via email.