I recently gave a talk at USF: University of San Francisco (USF) – SLS
The slides are available on slideshare.
I recently gave a talk at USF: University of San Francisco (USF) – SLS
The slides are available on slideshare.
http://www.jarvana.com/jarvana/view/org/apache/mahout/mahout-core/0.3/mahout-core-0.3-javadoc.jar!/org/apache/mahout/classifier/bayes/datastore/HBaseBayesDatastore.html
https://cwiki.apache.org/confluence/display/MAHOUT/Clustering+of+synthetic+control+data
http://wiki.apache.org/hadoop/EclipseEnvironment
https://sites.google.com/site/hadoopandhive/home/how-to-write-output-to-multiple-named-files-in-hadoop-using-multipletextoutputformat
Beyond Hadoop: Next-Generation Big Data Architectures: http://gigaom.com/cloud/beyond-hadoop-next-generation-big-data-architectures/
A Distributed Systems Reading List: http://www.dancres.org/reading_list.html
Cloud Computing: http://gigaom.com/cloud/
NoSQL at Twitter: http://www.slideshare.net/squarecog/nosql-at-twitterdevoxx2010
Cloudera: http://www.cloudera.com/resource/hw10_hadoop_based_intelligent_text_processing_system
Tutorial: https://cwiki.apache.org/confluence/display/Hive/Tutorial
Joins: https://cwiki.apache.org/confluence/display/Hive/Tutorial#Tutorial-Joins
Getting Started: https://cwiki.apache.org/confluence/display/Hive/GettingStarted#GettingStarted-RunningHive
Hive Client: https://cwiki.apache.org/Hive/hiveclient.html
Hive JDBC: https://cwiki.apache.org/Hive/hivejdbcinterface.html
I had problems starting 0.7.1, and had to download and use 0.6.0 with Apache Derby.
http://search.cpan.org/~gariev/Google-ProtocolBuffers-0.08/lib/Google/ProtocolBuffers.pm#MessageClass-%3Eencode%28$hashref%29
http://wiki.apache.org/hadoop/Hbase/MapReduce
http://lucene.apache.org/solr/tutorial.html