datatrove
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
3,073stars
Forks
268
Open issues
87
Watchers
3,073
Size
34.9 MB
PythonApache License 2.0
Created: Jun 14, 2023
Updated: May 29, 2026
Last push: May 26, 2026