Running kdb+ on Google Cloud (GCP)

Note: You can now subscribe to my blog updates here to receive latest updates. Update: This post is now available as a video tutorial on our YouTube channel. A lot has changed in the kdb+ world recently and as the world moves to the cloud, it was only a matter of time that kdb+ did as well. …

Kaggle: Solving Titanic challenge using logistic regression in Spark

In this post, I will show how to tackle Kaggle’s entry level challenge called Titanic. In this challenge, you are given training and test dataset. Your goal is to use the training dataset to build and train a model and then use it to predict whether a passenger will survive or not listed in test …

Subscribing to a message broker in q/kdb through embedPy

Update: As of May 2020, there is now an official Solace interface for kdb+ which can be used, instead of embedPy, to interact with Solace’s PubSub+ broker. You have undoubtedly heard of messaging platforms such as kafka, solace and rabbitMQ that allow applications to communicate with each other.  Messaging platforms have existed for several years …

Book Review: Head First Python by Paul Berry

I have never had any official training in Python and I am sure there are many other developers out there like me who came from a different language and easily picked up Python on their own. A very intuitive way of learning a new language is by using them to solve real world problems one …

Installing kdb+, jupyterq and embedPy using conda!

Note: You can now subscribe to my blog updates here to receive latest updates. Kx announced during their Kx25 event last week that you can now download and install kdb+, embedPy and jupyterq via conda. For those who don’t know, conda is a platform and language agnostic tool for installing packages and managing environments. It’s mainly used …

Kx celebrates 25 years of kdb

Note: You can now subscribe to my blog updates here to receive latest updates. Last Friday, on May 18th, Kx celebrated 25 years of kdb with a full day conference at New York Academy of Sciences in downtown Manhattan, NY. You can find the agenda here. In the last year or so, kdb+ has become increasingly ‘open’ …

My (awesome) experience at PyCon 2018 in Cleveland!

It’s been almost 3 years that I have been programming in Python. I picked up Python as a useful tool to solve some critical problems at work and quickly started appreciating all that it had to offer. I used different mediums to learn python. I sought guidance from other senior python developers. I read blog …

Top 5 metrics for evaluating regression models

In my previous posts, I have covered some regression models (simple linear regression, polynomial regression) and classification models (k-nearest neighbors, support vector machines). However, I haven’t really discussed in-depth different ways to evaluate these models. Without proper metrics, not only can you not claim the accuracy of your models confidently but you also cannot compare …

Implementing Support Vector Machine (SVM) algorithm in python

As you have probably noticed by now, there are several machine learning algorithms available at your disposal. In my previous post, I covered a very popular classification algorithm called K-Nearest Neighbors. In today’s post, I will cover another very common and powerful classification algorithm called Support Vector Machine (SVM). What is SVM and how does …

Implementing k-nearest neighbors in python

Last time, we looked into one of the simplest classification algorithms in machine learning called binomial logistic regression. In this post, I am going to cover another common classification algorithm called K Nearest Neighbors, otherwise known as KNN. To recap, we have mostly discussed regression models such as simple and multivariate linear regression and polynomial …