Dataset generation and transformation
-
Infinite Dataset Hub
♾281Search and save datasets generated with a LLM in real time
-
Fake Data Generator (JSONL)
🎰65Generate synthetic dataset files (JSON Lines)
-
Common Crawl Pipeline Creator
🕸22Create and customize a data processing pipeline for Common Crawl data
-
Dataset Spreadsheets
🤗15Edit Parquet datasets on Hugging Face