Join us
@okzapradhana ă» Sep 27,2022 ă» 12 min read ă» 1505 views
Few explanations about Docker, differences between Image and Container and Overview of Kubernetes Concept
Long story before containerization arises, we tend to use VMs approach on deploying applications. Every machine will have their OS with its libraries, makes the apps size is pretty huge as its contains OS. The diagram is closely like below:
VM runs on top of Hypervisor, if you have heard of VMware, Virtual Box, Hyper-V that are some examples of hypervisor. Each app is isolated within VM alongside with the OS and this is similar to container, the fundamentals that must understand by developers nowadays.
Basic concept of container is similar to container in real life, which is the way you package everything into one. Like this cargo container below:
You might often hearing these questions from your colleagues:
My app works smoothly on your laptop, but why it doesnât work on my machine?
Most likely they havenât implemented containerization on their applications that forces them to do installations step by step every time they want to test the app in local machine. Moreover the installations of its dependencies are differ on each Operating Systems. Letâs say that when you install NodeJS in Windows has different steps when you install it on Linux.
But with container, we wonât get bother by doing such repetitive things that will save us much time and could focus on the development instead.
More specifically, container wraps and package the code, dependencies, configurations in one file and makes it isolate from each other and with the OS. Though they are isolated but they can communicate by networks which is the happy part of containerzation!
As we saw before that VMs run on top of Hypervisor meanwhile container runs on top of some sort of it we call it as container runtime. There are some container runtimes but the most popular is Docker.
In world of virtualization and container, actually the things discussed above is more considered as definition of Image. Then whatâs the difference between image and container?
To get easier on understanding both terms, we can use Object Oriented Programming analogy, think of Image as Class and container as Object. We can instantiate many Objects from a Class, and so does container. We could create and run many containers from an image.
Then in short we could say that actually containers is image that running.
Letâs move straight to Docker, one of container runtime that popular among developers.
There is a brief explanation from Wikipedia about Dockerâs definition:
Docker is set of platform as a service products that use OS-level virtualization to deliver software in packages called containers.
Wikipedia (source)
With Docker, you can deliver your apps much easier despite what OS you used when development. Good things is Docker is available in Windows, Mac OS and Linux. This article wonât cover the installation step so if you insist to try on your machine, kindly follow these steps and choose based on your OS.
Just like containerized app diagram, Docker play its role as container runtime that manages the containers app and runs on top of Operating System.
Some key features when running container images in Docker â called as Docker Engine are:
These are written on Official Dockerâs page. Thatâs all about Docker and letâs go to practices section.
First of all, to interact with Docker we will use Docker CLI that are included on your Docker installation. To ensure that Docker has been installed successfully, check its version by executing docker --version
on your terminal:
Then, we can try to pull images from docker hub , letâs take Postgres Official Image as an example. To pull the image, run docker pull postgres:13.5
If you havenât pull the image, then it will download from docker hub then store it to your local image cache. To ensure that the image has been pulled successfully, execute docker images
on your terminal:
As you might see there is postgres
image with tag 13.5, but you might be confused as well why there is another postgres
image. If you looked at it more details, the other image has different tag that is 13-alpine
.
Every docker image has tag, consider this tag as versioning system. So we could say that now we have postgres installed in our machine that acts as Image with version 13.5, fortunately Postgres provide us the docs that straight to version 13.5, kindly check here.
In order to running system/app/program , simply just need to run the pulled image â called it by Docker Container with docker run
command:
docker run --name <container-name> -e POSTGRES_PASSWORD=<pg-password> -d postgres:13.5
There are some parameters here:
--name
is argument that used to specify the container name when it's running. Good things, docker is excellent at recognizing each container by giving name that results from auto generate if we didn't specify the container name.-e
is argument to specify environment variable that want to override when running the image. Postgres image does support some of environment variables, go check them here.-d
is to run the image in detach, means that the container will run in background and you can do other stuffs while container is initiating and running.postgres:13.5
is the image name, don't forget to specify the tag after colon symbol :
otherwise it will recognize to use latest
image tag which some of part we don't want that to happened.docker run
also has more than those options, check them here
What docker run
actually did is create container from images then running them. For some purpose we could create the container only, and run them later by executing docker create
command.
The definition of docker create
from Docker Official Documentations
The docker create command creates a writeable container layer over the specified image and prepares it for running the specified command
Docker Official Documentation (source)
Then create the container from image with:
docker create --name <container-name> -t -i -e POSTGRES_PASSWORD=<pg-password> postgres:13.5
To learn more about doker create
please go here.
After you executed the command it will return container ID as the STDOUT in your terminal. To ensure that the container has been created, use docker ps -a
You might see that the STATUS
of container is Created
means that the container has been created, but its not running yet. To run it, use docker start
docker start <container-name>|<container-id>
This command will instruct Docker server to run the created container into Up
status, or Exited
if there is error. Run docker ps
again to check:
The postgres container state now changed from Created
to Up
for 6 minutes in this case, and the Ports is allocated as well.
We already know that the container has been running but how to see whatâs the container doing back there? This is where logging play its role, Docker has provide us the command to see our containerâs log by using docker logs
docker logs <container-name>|<container-id>
Here is the output would like:
Since we used Docker Image of Database tools, make sure the container logs showing that they are ready to accept connections before you did any transactions.
For some reason, you might want to stop the container as it uses your machine resource as long as the container is running. You can do it easily by docker stop
docker stop <container-name>|<container-id>
This will put the container into Exited
status but still remains in container list when we execute docker ps -a
, to completely remove it we can use docker rm
command like below:
docker rm <container-name>|<container-id>
Note that you canât remove the container unless you stop it first otherwise it will lead you to error:
Though itâs possible to force remove but itâs not kind of thing thatâs recommend to do.
Itâs also possible to execute a command inside running container to know whatâs actually inside it. To do that, use docker exec
docker exec -it <container-name>|<container-id> <command>
To give you more understanding on this command, letâs try to explain some of its arguments:
-i
is same as interactive
that will help us to interact with the container as they provide STDIN to the user. If we remove the -i
argument then we won't get any feedback as our STDIN can't be processed by the container.
-t
in exec helps to produce pretty output to the user that interacts with the container. If we remove this argument then we will think that the container is hanging at first, while they are waiting for our input.
bash
is one of common command that you can choose if you want to run bash script inside the container.
docker exec
often helps us to understand what are things inside a container as we might having blurry of understanding what's container actually is. Also, we could interact with the running container as what we can do in UNIX terminal.
Refer here for docker exec
reference
Though there are still other topics related to Docker i.e Volume, Networking, Docker Compose but in this post I tried to give the basic things when we learning Docker for first time.
Okay, so we already learned about Docker in introduction. It provides you easiness to deploy and deliver your apps without worrying of setup and installing libraries, however using Docker only ainât scalable at all. And here comes Kubernetes to scale our apps!
From Kubernetes.io documentation, the definition is:
Kubernetes is a portable, extensible, open-source platform for managing containerized workloads and services, that facilitates both declarative configuration and automation
Kubernetes Official Documentation (source)
To put it simply, kubernetes is open source platform to managed containerized application. Itâs also known as k8s because there are 8 letters between k and s.
So, howâs k8s works and its architecture?
From architecture above, it gives us some terms that relates to k8s components, they are:
Node is actually a running virtual machine that resides in kube cluster, and we could scale the node vertically and horizontally. Vertical scaling means that we upgrade the specification of node i.e its CPU, memory, or storage but speaking the facts that horizontally scale has limit and at certain point it canât be scale in horizontal way anymore. So, in the end itâs recommend to do horizontal scaling because it has no limit as long as we can afford the price. Horizontal scaling means that we spawn new nodes in the cluster to share the loads with another node, usually we face this case when there is high traffic or some data needs to be processed in parallel thus we need more nodes by using data processing concept like Map Reduce.
Speaking about node, it has some components that we need to know about:
The first thing I noted when learning Kubernetes is about their hierarchial components and itâs helping me to have general understanding about k8s. I could say that:
In hierarchial, k8s has one or more cluster, each cluster has many nodes. Each node has one or more pods, each pod has one or more containers. Container terms in k8s has exactly same meaning with container in Docker that we discussed before, thatâs why we need to understand first about container.
Thatâs all for introduction to Kubernetes, letâs put more focus on the hands on session.
When reading about k8s you might got a question in mind. Do we have to setup the cluster from scratch, or using cloud like Google Kubernetes Engineor Amazon Kubernetes Service for the sake of learning and exploring kube?Happily, we donât need to do both. The community has developed âmini clusterâ to facilitate us in terms of learning k8s in practical way.
There are some options and personally I didnât have recommendation on which is best to use but Iâm using minikube until this article is written, and this post will using minikube as our demo.
Minikube helps us to setup local cluster with single node and already supported in Windows, Linux, and macOS. To install it on your machine, take a look on their docs here.
Ensure that minikube has been installed by checking its version with minikube version
While minikube acts to spawning cluster with its node, we need a way to communicate to the cluster and thatâs kubectl
plays the part!
kubectl is tools developed by community to interact with kube-apiserver. Remember that kube-apiserver is k8s front end, it means that we can interact with kubernetes cluster using kubectl
. If you havenât install it on your machine, please follow this guide and install it based on your Operating System.
Once you have kubectl
in your local, be sure to check itâs version by typing kubectl version
in terminal.
We could see wo informations in the output:
Now we have minikube and kubectl in our machine, whatâs next? In kubernetes, there are some objects/resources that we need to configure using kubectl
, actually you might not need to use all of them in your apps/projects but having knowledge on it surely will help you a lot and gives you reasoning on when we use certain object and why we use it instead of others.
To help you understand about kube objects, here I got some articles for you:
If you are looking for tutorial in Video version, I have some recommendations to take a look. Donât forget to support their channel as well!
After you feel more familiar with kubectl
and kube objects, I found cheatsheet provided by k8s docs that might be help you later
Finally we reached the end of this article, but donât worry we will continue this with implementing Kubernetes case in Airflow in Part II!
Join other developers and claim your FAUN account now!
Influence
Total Hits
Posts
Only registered users can post comments. Please, login or signup.