In this post, I'll walk through scikit-learn's DecisionTreeClassifier from loading the data, fitting the model and prediction.
We need to predict the class label of the last record ...
The Apache Hadoop is a framework that allows for distributed processing of large data sets across clusters of computers using MapReduce.
The steps listed below is to build and package hadoop from source code. This guide assumes a fresh installation of Ubuntu 14.04 version.
While writing Pig script, usually we use
PigStorage for loading a CSV file.
Consider a sample CSV file in the following format.
2,Loading successfull,2014-09-25 3,Loading successfull,2014-09-25 4,Loading successfull,2014-09-25
can be loaded as
logs = LOAD 'log_folder/log_file.csv' USING ...
I had a NumPy array of numbers, which I had to split based on the change of value.
For example, consider an array as shown below.
values = [112.0, 111.0, 113.0, 111.0, 112.0, 112.0, 112.0, 113.0, 113.0, 113.0, 114.0, 114 ...
Importing data from postgres tables into HDFS using sqoop could be done with the following steps.
Make sure postgres jdbc connector is available in
To list all available tables in the postgres database
$ sqoop list-tables \ --connect jdbc:postgresql://hostname.com/databaseName \ --username myUserName \ --password myPassword
Made the #pyconindia2014 funnel ...
Categorizing an array into different buckets in d3js with d3.nest.
For example, consider a dataset that contains an array of integers ranging values from 1 to 200. If we had to group this array into the following four buckets, as defined as;
When trying to connect remote postgresql server with pgAdmin, leads to "Unable to connect" error.
To enable pgAdmin to connect, edit the following configuration in the server:
$ sudo vim /etc/postgresql/9.3/main/postgresql.conf listen_address = '*' $ sudo vim /etc/postgresql/9.3/main/pg_hba.conf local all postgresql trust ...
Multi-node cluster installation guide
Though setting up single-node cluster from this guide is quite obvious, I'm documenting here those few so called obvious deviations.
The operating system I choose to install ...
I was getting the below mentioned exception on my Java client which is trying to establish URL connection to a server.
javax.net.ssl.SSLProtocolException: handshake alert: unrecognized_name
The Java client was actually trying to get an XML from the the URL and store it locally in a file as ...