Awesome DataFlow ๐Ÿš€

January 20, 2026 ยท View on GitHub

A curated list of awesome projects, research works, and applications built with or on top of DataFlow โ€” an LLM-driven framework for data preparation, synthesis, evaluation, and workflow automation in the era of Data-Centric AI.


๐Ÿ“ฆ Open-source Projects

Full repositories that directly use DataFlow as a core dependency.

  • Project Name
    Short description of the project.
    e.g., Uses DataFlow pipelines to curate high-quality SFT and RLHF data.

How to contribute

  1. Fork this repository
  2. Add your project to the appropriate section
  3. Keep descriptions concise (1โ€“2 lines)
  4. Submit a Pull Request

Contribution guidelines

  • The project must use DataFlow (core framework or ecosystem)
  • Open-source repositories are preferred
  • Avoid promotional or closed-source-only entries
  • Keep descriptions objective and factual

๐Ÿ“œ License

This list is released under the CC0-1.0 License (public domain).