I wrote a book! It's called Starting with Spark. You should read it.

all | popular | tags | rss

Installing Apache Spark 1.3.1 on HDP 2.2.4.2

In this section we will configure Spark 1.3.1 on Hortonworks Sandbox with HDP 2.2.

Continue Reading »

spark, hadoop Comments

Configuring Hortonworks Sandbox on Azure

For folks attending the workshop at Hadoop Summit, San Jose 2015 we provided Microsoft Azure Pass. If you already have an Azure account skip this step. If you are...

Continue Reading »

azure, hadoop Comments

Covering the HBase

Apache HBase was initially developed by Powerset, a natural language search engine startup in 2006. Then in 2008 they contributed the code base to the Apache Soft...

Continue Reading »

hbase, hadoop Comments

How to build a Hadoop VM with Ambari and Vagrant

In this post, we will explore how to quickly and easily spin up our own VM with Vagrant and Apache Ambari. Vagrant is very popular with developers as it lets one ...

Continue Reading »

vagrant, hadoop Comments

Deploying a Hadoop cluster on EC2

In this post, we’ll walk through the process of deploying an Apache Hadoop 2 cluster on the EC2 cloud service offered by Amazon Web Services (AWS), using Hortonwo...

Continue Reading »

ec2, hadoop Comments

Data dependency concerns in parallel computing

The most severe bottlenecks in high performance systems in majority cases results from I/O operations. To buffer I/O or other slow accesses, engineers devised cac...

Continue Reading »

64 bit Steamroller on our way

Over the weekend I installed and configured a new build [v. 1421] of the Windows XP x64 Edition on my Compaq Presario 3000 Laptop with a 64-bit processor (just in...

Continue Reading »

64-bit, windows Comments
« Newer Posts Page 3 of 3