bigdata.md
July 15, 2021 · View on GitHub
Bookmarks tagged [bigdata]
www.codever.land/bookmarks/t/bigdata
awesome-bigdata
https://github.com/onurakpolat/awesome-bigdata#readme
A curated list of awesome big data frameworks, ressources and other awesomeness. - onurakpolat/awesome-bigdata
- tags: awesome-list, bigdata
- :octocat: source code
awesome-public-datasets
https://github.com/awesomedata/awesome-public-datasets#readme
A topic-centric list of HQ open datasets. PR ☛☛☛. Contribute to awesomedata/awesome-public-datasets development by creating an account on GitHub.
- tags: awesome-list, bigdata, datasets
- :octocat: source code
awesome-hadoop
https://github.com/youngwookim/awesome-hadoop#readme
A curated list of amazingly awesome Hadoop and Hadoop ecosystem resources - youngwookim/awesome-hadoop
- tags: awesome-list, bigdata, hadoop
- :octocat: source code
awesome-data-engineering
https://github.com/igorbarinov/awesome-data-engineering#readme
A curated list of data engineering tools for software developers - igorbarinov/awesome-data-engineering
- tags: awesome-list, bigdata, data-engineering
- :octocat: source code
awesome-streaming
https://github.com/manuzhang/awesome-streaming#readme
a curated list of awesome streaming frameworks, applications, etc - manuzhang/awesome-streaming
- tags: awesome-list, bigdata, streaming
- :octocat: source code
awesome-spark
https://github.com/awesome-spark/awesome-spark#readme
A curated list of awesome Apache Spark packages and resources. - awesome-spark/awesome-spark
- tags: awesome-list, bigdata, apache-spark
- :octocat: source code
The Essential Guide to Machine Data
https://www.splunk.com/pdfs/ebooks/the-essential-guide-to-machine-data.pdf
Whatever you call it, machine data is one of the most underused and undervalued assets of any organization. And, unfortunately, it’s usually kept for some minimum amount of time before being tossed ou...
面向程序员的数据挖掘指南
http://dataminingguide.books.yourtion.com
数据挖掘中经典的算法实现和详细的注释
https://github.com/linyiqun/DataMiningAlgorithm
大数据/数据挖掘/推荐系统/机器学习相关资源
https://github.com/Flowerowl/Big-Data-Resources
大型集群上的快速和通用数据处理架构
https://code.csdn.net/CODE_Translation/spark_matei_phd
Spark 编程指南简体中文版
https://aiyanbo.gitbooks.io/spark-programming-guide-zh-cn/content/