Migrate datasets from GitHub to Hugging Face Hub
Support by Parquet, CSV, Jsonl, XLS
List of French datasets not referenced on the Hub
Organize and process datasets efficiently
Display instructional dataset
Browse and view Hugging Face datasets
Upload files to a Hugging Face repository
Save user inputs to datasets on Hugging Face
Display translation benchmark results from NTREX dataset
Rename models in dataset leaderboard
Explore and edit JSON datasets
Explore datasets on a Nomic Atlas map
ReWrite datasets with a text instruction
The Github To Huggingface Dataset Migration Tool is a specialized utility designed to simplify the process of transferring datasets from GitHub repositories to the Hugging Face Hub. This tool streamlines the migration process, ensuring that datasets are accurately and efficiently moved while maintaining their integrity. It is particularly useful for data scientists and researchers who need to share, collaborate, or manage datasets across different platforms.
• GitHub Repository Support: Migrate datasets directly from GitHub repositories, including support for various data formats.
• Hugging Face Integration: Seamless integration with Hugging Face Hub, ensuring datasets are properly uploaded and formatted.
• Data Validation: Automatically checks for dataset integrity and consistency during the migration process.
• Progress Tracking: Real-time progress monitoring to keep users informed about the migration status.
• Customization Options: Allows users to configure settings such as dataset naming, descriptions, and privacy levels on Hugging Face Hub.
What datasets can I migrate using this tool?
The tool supports a wide range of dataset formats commonly found in GitHub repositories, including CSV, JSON, and text files.
Is my data safe during migration?
Yes, the migration process uses secure authentication methods to protect your data. However, ensure you have the necessary permissions for dataset access and migration.
Can I migrate large datasets?
Yes, the tool supports large datasets. However, you may need to check the storage limits on Hugging Face Hub for your account type.
How long does the migration take?
Migration time depends on the size of the dataset and your internet connection. The tool provides real-time progress updates to help you track the process.
Can I pause or resume the migration?
Currently, the tool does not support pausing or resuming migrations. For large datasets, ensure a stable internet connection and uninterrupted execution.