Distributed Data Processing Frameworks
- Mapreduce Framework
- Research Papers
- Apache Hadoop
- Projects in Hadoop Family
- Hive
- Pig
- HBase
- Column-Oriented Database
- Apache Sqoop
- Hadoop <data> RDBMS
- Zookeeper
- Impala
- Massively parallel processing SQL for Hadoop
- Apache Accumulo
- Sorted, distributed key/value store; based on Google's BigTable.
- Bulk Synchronous Parallel Model
- Pregel
- Bulk Synchronous Parallel Computations for processing Graphs
- Research Paper
- Apache Giraph
- Stratosphere
- Research Paper
- Spark
- Spark Streaming, MLlib, GraphX, Shark
- Storm
- Distributed and fault-tolerant real time computation
- Percolator
- Dremel
- Research Paper
No comments:
Post a Comment