In this multi-part series, we will explore how to get started with tensorflow. This tensorflow tutorial will lay a solid foundation to this popular tool that everyone seems to be talking about. The second part is a tensorflow tutorial on getting started, installing and building a small use case.
This post is the second part of the multi-part series on a complete tensorflow tutorial –
- Part 1: Introduction
- Part 2: Getting Started
- Part 3: Building the first model
Tensorflow Tutorial : Goals
- Installing Tensorflow
- Verifying Tensorflow Installion
- Before Building your first tensorflow model
Installing Tensorflow
If you have tensorflow already installed, you can just skip to the next section.
1. Operating System
Different operating systems have different means to install tensorflow. You can go through the documentation for more details. I will just be discussing what is essential to get started.
- Click the Install tab
- The following guides explain how to install a version of tensorflow that enables you to write applications in Python.
Try to follow the most basic structures with the best practices already mentioned in the documentation.
2. GPU
Installing Tensorflow with GPU requires you to have NVIDIA GPU. AMD video cards are not supported with tensorflow. NVIDIA uses low level GPU computing system called CUDA. It is an NVIDIA proprietary software.
One can go the OpenCL way with AMD but as of now it won’t work with tensorflow.
Also, all NVIDIA devices are not supported. Here is a list from the NVIDIA documentation listing the supported GPUs.
3. Environment
There are three environments you can leverage to setup tensorflow –
- Direct – This is just a typical way of installing any software library. It directly interacts with the installed operating system and installs tensorflow like any other library.
pip
is the preferred way of direct installation - Virtual environment – To all who are unaware of the advantages of a python virtual environment, a
virtualenv
is like a parallel python installation apart from the default python that is already installed in our system. Installing libraries in a virtual environment keeps the libraries separate and you will never have a clash of compatibility with the other libs that are installed directly. If anything goes wrong, you can just fire up a new virtual environment and start afresh - Docker container – This is another way of installing tensorflow. Think of a typical VMware image of a guest operating system to be a docker container on steroids. Docker can be used to start containers that have different environments at the operating system, thereby allowing you to have an environment completely separate to what you have on your host machine. All details are clearly labelled out in the tensorflow documentation
4. Python Version
The 2.7 version from Python 2 and 3.3 or later from Python 3 are supported. This holds true for all operating systems.
As of now, Windows only supports version 3.5. Python 2 with windows is a combination that is not supported
Verifying the Installation
Once tensorflow is installed, irrespective of the operating system, environment or the the python version, you should run the following script to verify tensorflow is up and running.
# import TensorFlow
import tensorflow as tf
sess = tf.Session()
# Verify we can print a string
hello = tf.constant("Hello UNP from TensorFlow")
print(sess.run(hello))
# Perform some simple math
a = tf.constant(20)
b = tf.constant(22)
print('a + b = {0}'.format(sess.run(a + b)))
Once this code runs and prints the output successfully, congratulations! You have successfully installed tensorflow. Let’s move to the next section to build our first application
Before Building your first tensorflow model
Tensors
The following are the three types of tensors we need to learn before getting started.
Type | Description |
Constant | Constant values |
Variable | Values adjusted in the graph |
Placeholder | Used to pass data in the graph |
Before diving into the hands on, I just want to introduce a few more terms in the tensorflow terminologies along with their meanings.
- Rank – Dimensionality of a tensor
- Shape – Shape of a tensor. Related to rank
The above diagram should help in understanding. Below we have the different data types in supported by Tensorflow.
Note: Quantitized values [qint8, qint16 and quint8] are special values for tensorflow that help reduce the size of the data. In fact, Google has gone to the extent of introducing Tensorflow Processing Units (TPUs) to speed up computation by leveraging quantitized values
Prepare the Data
We will quickly generate some data to get started. In this case, we will generate house size to predict house prices. The goal here isn’t to build a sophisticated house price predictor, but to get things moving in tensorflow in the simplest possible way.
We will generate some data using the below python code –
import tensorflow as tf import numpy as np import math import matplotlib.pyplot as plt # generation some house sizes between 1000 and 3500 (typical sq ft of house) num_house = 160 np.random.seed(42) house_size = np.random.randint(low=1000, high=3500, size=num_house ) # Generate house prices from house size with a random noise added. np.random.seed(42) house_price = house_size * 100.0 + np.random.randint(low=20000, high=70000, size=num_house) # Plot generated house and size plt.plot(house_size, house_price, "bx") # bx = blue x plt.ylabel("Price") plt.xlabel("Size") plt.show()
This generates the below output [This is a plot of the generated data]
Next, we are going to normalize the data. This helps in bringing the data in the same scale which in turn can lead to faster convergence.
We also split it into train and test data as part of the data science best practices. We will train our model on the training data and test our model on the test data to see how accurate our predictions are.
# you need to normalize values to prevent under/overflows.
def normalize(array):
return (array - array.mean()) / array.std()
# define number of training samples, 0.7 = 70%.
# We can take the first 70% since the values are randomized
num_train_samples = math.floor(num_house * 0.7)
# define training data
train_house_size = np.asarray(house_size[:num_train_samples])
train_price = np.asanyarray(house_price[:num_train_samples:])
train_house_size_norm = normalize(train_house_size)
train_price_norm = normalize(train_price)
# define test data
test_house_size = np.array(house_size[num_train_samples:])
test_house_price = np.array(house_price[num_train_samples:])
test_house_size_norm = normalize(test_house_size)
test_house_price_norm = normalize(test_house_price)
Conclusion
I hope this sets the expectations for what is about to come. In the next post we will build our first TensorFlow model.
In the next part, we will finally be ready to train our first tensorflow model on house prices. It will give us our first hands on experience with tensorflow!
For original blog, click here