Recently have been playing with HortonWorks HDP 2.2. Was starting to configure some oozie workflows and when submitting the job the first step’s Hive script failed with this error and stack. To fix this, SSH into your HDP instance VM and edit: /etc/hadoop/conf/core-site.xml and change the following config to add “localhost”. Save and restart the relevant services […]Read More Fix: HDP “Unauthorized connection for super-user: oozie from IP 127.0.0.1”
Just went through this little primer on the R language for data analysis. Pretty good and worth 30 minutes of your time. Check it out. http://tryr.codeschool.com/Read More Give R a shot
If you haven’t been exposed to Nathan Marz’s ideas on Big Data, the following links are definitely worth your time: http://manning.com/marz/ http://www.infoq.com/presentations/Complexity-Big-Data http://nathanmarz.com/speaking/Read More Ideas worth spreading
Came across this creative presentation on Redis: http://www.slideshare.net/JustinCarmony/blazing-data-with-redis-20Read More Excellent presentation on Redis
Recently I was working on implementing a custom IAuthenticator and IAuthority for Cassandra 1.1.1 because really there is not much/any security out of the box. For those of you familiar with Cassandra, its distribution used to include a simple property file based implementation of the IAuthentication and IAuthority that you could reference in your cassandra.yaml file […]Read More Astyanax -> Cassandra PoolTimeoutException during Authentication failure?
Today I pushed up some source to Github for a utility I was previously working on to load data from USPS AIS data files into HBase/Mysql using Hadoop mapreduce and simpler data loaders. Source @ https://github.com/bitsofinfo/usps-ais-data-loader This project was originally started to create a framework for loading data files from the USPS AIS suite of […]Read More USPS AIS bulk data loading with Hadoop mapreduce
I recently started playing around with Redhat’s Openshift PaaS and installed the MongoDB and RockMongo cartridges on my application. My use case was just to leverage the Openshift platform to run my MongoDB instance for me, and I really was ready (nor needing) to push an actual application out to the application running @ openshift; […]Read More How to access your OpenShift MongoDB database remotely on OS-X