bigdata.md

July 15, 2021 · View on GitHub

Bookmarks tagged [bigdata]

www.codever.land/bookmarks/t/bigdata

awesome-bigdata

https://github.com/onurakpolat/awesome-bigdata#readme

A curated list of awesome big data frameworks, ressources and other awesomeness. - onurakpolat/awesome-bigdata


awesome-public-datasets

https://github.com/awesomedata/awesome-public-datasets#readme

A topic-centric list of HQ open datasets. PR ☛☛☛. Contribute to awesomedata/awesome-public-datasets development by creating an account on GitHub.


awesome-hadoop

https://github.com/youngwookim/awesome-hadoop#readme

A curated list of amazingly awesome Hadoop and Hadoop ecosystem resources - youngwookim/awesome-hadoop


awesome-data-engineering

https://github.com/igorbarinov/awesome-data-engineering#readme

A curated list of data engineering tools for software developers - igorbarinov/awesome-data-engineering


awesome-streaming

https://github.com/manuzhang/awesome-streaming#readme

a curated list of awesome streaming frameworks, applications, etc - manuzhang/awesome-streaming


awesome-spark

https://github.com/awesome-spark/awesome-spark#readme

A curated list of awesome Apache Spark packages and resources. - awesome-spark/awesome-spark


The Essential Guide to Machine Data

https://www.splunk.com/pdfs/ebooks/the-essential-guide-to-machine-data.pdf

Whatever you call it, machine data is one of the most underused and undervalued assets of any organization. And, unfortunately, it’s usually kept for some minimum amount of time before being tossed ou...


面向程序员的数据挖掘指南

http://dataminingguide.books.yourtion.com


数据挖掘中经典的算法实现和详细的注释

https://github.com/linyiqun/DataMiningAlgorithm


大数据/数据挖掘/推荐系统/机器学习相关资源

https://github.com/Flowerowl/Big-Data-Resources


大型集群上的快速和通用数据处理架构

https://code.csdn.net/CODE_Translation/spark_matei_phd


Spark 编程指南简体中文版

https://aiyanbo.gitbooks.io/spark-programming-guide-zh-cn/content/