Quick and Easy Capacity Planning with Pandas

The discipline of Operations Research (OR) applies analytical methods from math, statistics, economics, and computer science to help leaders make good decisions. Enterprise software consumes physical resources (CPU, Memory, Disk, and Bandwidth)...

comments

Data Exploration with Data Viz Cheat Sheet

Today I collect and organize useful data visualization (Data Viz) tools that aid data exploration. I illustrate the use of the tools via the classic Abalone database, hosted on the University of California, Irvine (UCI) Machine Learning...

comments

Refactor Matlab to Tidyverse

I've previously discussed the Reduced Coloumb Energy Neural Net algorithm on this site. I wrote the algorithm in Matlab, which uses index based logic to select, filter, wrangle and process data. Today I will refactor the Matlab code to Tidyverse....

comments

FastAI x Flask - Mods vs. Rockers!

Fastai provides helper functions on top of Pytorch to help us wrangle, clean, and process data. In this HOWTO we will accomplish the following: Deploy an AWS g3.8xlarge instance Compile and install NVIDIA drivers on our g3.8xlarge instance Use a...

comments

Big Data Idol: The Math

Caution! Math Ahead! For the Math-phobic, I explain how I crunch the test results in a math-free, simple and focused blog post here. I use math here, so this may be your last chance to escape! Still with me? Excellent! The bullets below outline...

comments

Big Data Idol: How I Crunched the Numbers

Do you have big data chops? Quick, what do these three things have in common? Yankees, Giants, Rangers, Knicks What about these? Beatles, Monkees, Beach Boys Do you have an answer for each? "New York," for example, for the first list and "Rock...

comments

Let us now praise ugly code!

In this blog post I will revisit the first piece of code I wrote with the R Programming language, back in the early part of this decade. Coming from an Octave/MATLAB background, I really enjoyed the functional nature of R. I imagined flinging...

comments

Why A "Big Data" Personality Test?

Why do we need yet another personality test? Because, without "big data" technologies online "personality tests" suffer these problems: With most tests, we quickly see a pattern to the answers, and can easily steer the test to the outcome we want...

comments