Organize and process datasets for AI models
Create a large, deduplicated dataset for LLM pre-training
Access NLPre-PL dataset and pre-trained models
Count tokens in datasets and plot distribution
Explore and manage datasets for machine learning
Organize and process datasets using AI
Organize and process datasets using AI
Label data for machine learning models
Browse a list of machine learning datasets
Upload files to a Hugging Face repository
Data annotation for Sparky
Browse and extract data from Hugging Face datasets
Explore recent datasets from Hugging Face Hub
g is a powerful tool designed for dataset creation and management. It simplifies the process of organizing and processing datasets, making it easier to prepare data for AI models. Whether you're working with raw data or refining existing datasets, g provides intuitive features to streamline your workflow.
What file formats does g support?
g supports a wide range of formats, including CSV, JSON, Excel, and more, making it versatile for different data sources.
Can I collaborate with others in real time?
Yes, g offers real-time collaboration features, allowing teams to work together on datasets seamlessly.
Where is my data stored?
Your data is stored locally by default, but you can choose to save it to cloud storage services for easier access and collaboration.