Idea: Blockchain for a better environment

I’ve always been fascinated about doing something for the environment. Recently I had an idea about using blockchain to implement a system that can potentially reduce air pollution and increase people’s heal...     Read more »

Project Euler - Problem 2

Link: https://projecteuler.net/problem=2 Problem details: Each new term in the Fibonacci sequence is generated by adding the previous two terms. By starting with 1 and 2, the first 10 terms will be: 1, 2,...     Read more »

Project Euler - Problem 1

Link: https://projecteuler.net/problem=1 Problem details: If we list all the natural numbers below 10 that are multiples of 3 or 5, we get 3, 5, 6 and 9. The sum of these multiples is 23. Find the sum of ...     Read more »

Salary survey & blindspots - A statistics perspective

There was an article on VNExpress (one of the largest Vietnam’s news website) about IT industry’s salary survey in 2016. It was shared by many friends of mine and quickly became the topic for debates and dis...     Read more »

Return multiple values from a C function

Basically C allows you to return only a single value from a function, but if you know pointer, everything’s changed. A small trick with this tool with let you return as many values as you want from a C funct...     Read more »

Introducing vnnews package

I’ve had to crawl the web to collect a lot of data for studying machine learning recently. Writing the same code again and again to perform the same task made me decided to write a separate Python package fo...     Read more »

Movie Review - Sentimental Analysis using Bag of Words

This post is my practical code that follow the Kaggle tutorial about Bag of Words model for Natural Language Processing. Please refer to the following link to read the notebook. Click here to view the noteb...     Read more »

World Population Analysis

In this post, we will crawl world population data from Wikipedia and do some analysis. The data is taken from this page provides us the population detail for every country on Earth every 5 years, from 1955 u...     Read more »

Vietnam IT Jobs Analysis

Introduction I’ve been looking for a Data Analysis job recently and failed to do so. Actually I got several offers for Python Developer position but the work in these companies are quite boring so I left. T...     Read more »

Using Fingerprint Reader on Ubuntu

Today I saw a colleague using Fingerprint reader to login to his Ubuntu 14.04. I was also impressed when he did some installation using apt-get or pip install, the system asked him to swipe his finger across...     Read more »

Monty Hall problem

This is a very famous probability puzzle. It’s based on the American television game show Let’s Make Deal and not surprisingly the name Monty Hall was the game show’s host name. Following is a variation of t...     Read more »

Travelling Salesman Problem

Introduction In the previous post, I introduced you the concepts of NP problem. Travelling Salesman Problem (or TSP) is one of them. I’ll take its definition from Wikipedia: The travelling salesman prob...     Read more »

Scraping Vietnamworks job

Introduction This document shows some basic web scraping steps to crawl Vietnamworks and get 50 newest jobs. Below are some libraries used in this doc: BeautifulSoup 4: worker for all scraping activi...     Read more »

NP Problems

Recently I was asked to research something called NP problem. NP stands for Non-deterministic Polynomial time. In simple term, it’s a category of algorithms that we can not calculate its run time (in regular...     Read more »

Using KMeans to cluster 1D data

Problem: Given the following set: { 2, 4, 10, 12, 3, 20, 30, 11, 25 }. Write pseudo code for k-means clustering algorithm to cluster the above set to 2 clusters. Then implement code to achieve the same. ...     Read more »