SomeAI.org
  • Hot AI Tools
  • New AI Tools
  • AI Category
SomeAI.org
Home
Dataset Creation
🧬

Synthetic Data Generator

Build datasets using natural language

473
🐶

Convert to Safetensors

Convert and PR models to Safetensors

238
📖

TxT360: Trillion Extracted Text

Create a large, deduplicated dataset for LLM pre-training

106
📄

PDF to Dataset

Convert PDFs to a dataset and upload to Hugging Face

88
🤗

Datasets Tagging

Create and validate structured metadata for datasets

82
🚀

Research Tracker

74
🔎

Semantic Hugging Face Hub Search

Search and find similar datasets

66
👁

Datasets Convertor

Support by Parquet, CSV, Jsonl, XLS

56
🌍

Space to Dataset Saver

Save user inputs to datasets on Hugging Face

31
💻

Domain Specific Seed

Create a domain-specific dataset project

23
📊

Reddit Dataset Creator

Create Reddit dataset

19
⏰

SmolVLM2 IPhone Waitlist

sign in to receive news on the iPhone app

17
🚀

gradio_huggingfacehub_search V0.0.7

Search for Hugging Face Hub models

15
✍

Dataset ReWriter

ReWrite datasets with a text instruction

13
⚗

Distilabel Synthetic Data Pipeline Finder

Find and view synthetic data pipelines on Hugging Face

12
🔀

Open LLM Leaderboard Renamer

Rename models in dataset leaderboard

12
🦀

Recent Hugging Face Datasets

Explore recent datasets from Hugging Face Hub

11
⚗

Distilabel Dataset Generator

Create datasets with FAQs and SFT prompts

10
💻

Collection Dataset Explorer

Browse and view Hugging Face datasets

9
📈

Trending Repos

Display trending datasets from Hugging Face

9
SomeAI.org

Discover 10,000+ free AI tools instantly. No login required.

About

  • Blog

© 2025 • SomeAI.org All rights reserved.

  • Privacy Policy
  • Terms of Service