sexta-feira, 6 de abril de 2012

[EN] - Big Data Helper - Part 2 - Getting Started

After the Concepts post


Now it is time to really get our hands ready to actually DO something. Well what do you need?
There are (at least) two choices:
1 - setup your own environment from scratch
2 - get a pre-built environment to start playing around in minutes.


Plan #1
- go to http://hadoop.apache.org/ and download all the necessary binaries, install an operating system (not in this order) and configure all the necessary layers.


Plan #2
- go to https://ccp.cloudera.com/display/SUPPORT/Downloads and download the Cloudera CDH virtual machine files and just use VMWare Player to get started (for example)
- go to http://www.mapr.com/download and download the MapR M3 virtual machine files to get started
- go to <insert other link here> (I don't know any other companies supplying such resources).


After this you are ready to go.
What can be done with these VMs, will be discussed in later posts. 


For the rest of these series of posts, I will be using the Cloudera CDH3u3 distribution.


Until then.
Thank you.




-- ====================

Other Tutorial Links



http://pinelasgarden.blogspot.pt/2012/04/en-big-data-helper-part-1-concepts.html
http://pinelasgarden.blogspot.pt/2012/05/en-big-data-helper-part-3-loading-data.html
http://pinelasgarden.blogspot.pt/2012/05/en-big-data-helper-part-4-pig.html
http://pinelasgarden.blogspot.pt/2012/05/en-big-data-helper-part-5-mapreduce.html

Sem comentários:

Enviar um comentário