The Tale of Kubernetes!

vijay Last Updated : 26 May, 2022

10 min read

This article was published as a part of the Data Science Blogathon.

Introduction

Kubernetes popularly known as (K8s) is a system for automating deployment, scaling, and managing containerized applications.

An application’s containers are grouped into logical units, which can be easily managed and discovered. A key element of Kubernetes is 15 years’ experience running production workloads at Google in conjunction with best-of-breed ideas and practices from the community.

As part of the microservices architecture, Kubernetes represents a step away from monolithic architectures, where services are decoupled, isolated, and only as big as they need to be. Developed in the form of containers, these microservices are launched in seconds and can be terminated after only minutes of use.

If you’re going to unbundle and containerize your monolith, why bother when the end result is near-identical to the user? It’s nearly as easy to maintain and develop as deploying. Microservices offer a number of advantages over monolithic architectures, such as simplified configuration management, more precise allocation of resources, and improved performance monitoring.

We will go over everything you need to know about preparing your applications for life inside cloud containers in this guide.

Essential of Kubernetes Architecture

https://tinyurl.com/2tp33ydc

Containers: In general, a container is an isolated “micro-application” with all the necessary software packages and libraries.

Pods: Pods group together two or more containers and are a key part of the K8s architecture. If a pod fails, K8s can automatically replace it, add more CPU and memory, and even replicate it to scale out. Pods are assigned IP addresses. A “controller” manages a group of pods, which together form a scalable “workload.” The pods are connected via “services,” which represent the entirety of the workload. In spite of scaling or destruction of some pods, a service balances web traffic across the pods. It’s also important to note that storage volumes are attached to pods and are exposed to containers within those pods.

Controller: A controller is a control loop that compares the desired and actual states of the K8s cluster and connects with the API server to create, update, and delete the resources it manages. The label selectors in the controller determine the set of pods that the controller may control. Replication Controller (scales pods), DeamonSet Controller (ensures each node gets one copy of a designated pod), Job Controller (runs software at a specific time), Cronjob Controller (schedules jobs that run periodically), StatefulSet Controller (controls stateful applications), and Deployment Controller are all examples of K8s controllers (controls stateless applications).

Node: A pod can be scheduled on a node, a physical or virtual server. A container runtime, a kubelet pod, and a Kube-proxy pod are all installed on every node (more on those three items to come). Manually and automatically scaling node groups (also called autoscaling groups or node pool groups) is possible.

Volumes: K8s storage volumes allow long-term data storage that is accessible throughout the pod’s lifespan. A pod’s containers can share a storage volume. A node can also access a storage volume directly.

Services: A Kubernetes service is a collection of pods that cooperate. An example of a K8s service is shown below.

In a service configuration file, a “label selector” defines a collection of pods that make up a single service. The service feature gives the service an IP address and a DNS name, and it round-robins traffic to addresses that match the label selector. This approach effectively allows the frontend to be “decoupled” (or abstracted) from the backend.

Kube-proxy: The Kube-proxy component is installed on each node and is responsible for maintaining network services on worker nodes. It is also in charge of maintaining network rules, allowing network communication between services and pods, and routing network traffic.

Kubelet: A kubelet runs on each node and sends information to Kubernetes about the state and health of containers.

Container Runtime: The container runtime is the software that allows containers to run. Containers, Docker, and CRI-O are popular container runtime examples.

Controller Plane: A cluster’s main controlling unit is the Kubernetes master. It manages the entire K8s cluster, including workloads, and serves as the cluster’s communication interface. The Kubernetes controller plane is made up of several parts. These components enable Kubernetes to run highly available applications. The following are the primary components of the Kubernetes control plane :

Etcd: etcd stores the cluster’s overall configuration data (i.e., the state and details of a pod), thereby representing the cluster’s state in Kubernetes master nodes. The API Server monitors the cluster and makes changes to the cluster to match the desired state set using etcd data.
API Server: JSON data is sent over HTTP. It serves as Kubernetes’s internal and external interface. It processes requests, validates them, and instructs the appropriate service or controller to update the state object in etcd, as well as allowing users to configure workloads across the cluster.
Scheduler: evaluates nodes to determine where an unscheduled pod should be placed based on CPU and Memory requests, policies, label affinities, and workload data locality

Controller Manager: The controller manager is a single process that oversees all of Kubernetes’ controllers. While the controllers are logically separate processes, they are run as a single process in a DaemonSet to simplify things.

Prerequisites

Hardware Requirements

One or more machines running one of:

Ubuntu 16.04+

Debian 9

CentOS 7

RHEL 7

Fedora 25/26 (best-effort)

HypriotOS v1.0.1+

Container Linux (tested with 1800.6.0)

Minimal required memory & CPU (cores)

The master node’s minimal required memory is 2GB and the worker node needs a minimum is 1GB

The master node needs at least 1.5 and the worker node needs at least 0.7 cores.

Cluster Setup

To manage your cluster you need to install kubeadm, kubelet, and kubectl.

kubeadm: the command to bootstrap the cluster.

kubelet: the component that runs on all of the machines in your cluster and does things like starting pods and containers.

kubectl: the command-line tool to talk to your cluster.

Steps to follow

Configure IP Tables
Disable SWAP
Install Docker & configure
Install Kubeadm-Kubelet & Kubectl
Create Default Audit Policy
Install NFS Client Drivers

Key Features of Kubernetes

https://tinyurl.com/yc8rdrhh

Service Discovery and Load Balancing: Kubernetes assigns a single DNS name and an IP address to each group of pods. When there is a high volume of traffic, K8s automatically balance the load across a service that may include multiple pods.

Automated Rollouts and Rollbacks: K8s can deploy new pods and swap them with existing pods. It can also change configurations without affecting end users. Kubernetes also includes an automated rollback feature. If a task is not completed successfully, this rollback functionality can undo the changes.

Secret and Configuration Management: Configuration and secret information can be securely stored and updated without the need to rebuild images. Stack configuration does not necessitate the disclosure of secrets, reducing the risk of data compromise.

Storage Orchestration: K8s can be used to mount a variety of storage systems, including local storage, network storage, and public cloud providers.

Automatic Bin Packing: Kubernetes algorithms attempt to allocate containers efficiently based on configured CPU and RAM requirements. This function assists businesses in optimizing resource utilization.

Self-Healing: Kubernetes performs health checks on pods and restarts containers that fail. If a pod fails, K8s does not allow connections to it until the health checks are completed.

Difference Between Kubernetes vs Docker

Kubernetes	Docker Swarm
Complex installation	Easier installation
More complex with a steep learning curve, but more powerful	Lightweight and easier to learn but limited functionality
Supports auto-scaling	Manual scaling
Built-in monitoring	Needs third-party tools for monitoring
Manual setup of the load balancer	Autoload balancer
Need for a separate CLI tool	Integrated with Docker CLI
Supports 2000 > nodes	Supports up to 5000 nodes
Public cloud service provider – Azure only	Public cloud service providers – Google, Azure, and AWS
Speed is considered for the strong cluster states.	Offers container deployment and scaling, even in large clusters without considering the speed.
Used by Spotify, Pinterest, eBay, Twitter, etc.	Used by 9GAG, Intuit, Buffer, Evernote, etc.

Real-World Case Studies

Babylon

Challenge

Babylon’s products greatly rely on machine learning and artificial intelligence, and in 2019, there wasn’t enough computing power at Babylon to run a particular experiment. As well as growing (from 100 to 1,600 in three years), the company planned to expand to other countries.

Solution

The infrastructure team at Babylon moved its Kubernetes-based user-facing applications to Kubeflow in 2018, a toolkit that enables machine learning on Kubernetes. We designed a Kubernetes core server, we deployed Kubeflow, and we orchestrated the whole experiment, which turned out very well,” says AI Infrastructure Lead Jérémie Vallée. A self-service platform for AI training is being developed on top of Kubernetes.

Impact

As opposed to waiting hours or days for access to computing resources, teams can access them instantly. It used to take 10 hours for a clinical validation to be completed; now it takes under 20 minutes. Additionally, Babylon’s cloud-native platform has enabled it to expand internationally.

Booking.com

Challenge

Booking.com implemented OpenShift in 2016, which allowed product developers to access infrastructure faster. The infrastructure team became a “knowledge bottleneck” when challenges arose because Kubernetes was abstracted from the developers. The ability to scale support became unsustainable.

Solution

The platform team decided, after using OpenShift for a year, to build their own vanilla Kubernetes platform, and ask developers to learn some Kubernetes in order to use it. Ben Tyler, Principal Developer, B Platform Track, says “This is not a magical platform.”. You can’t use it just by following the instructions. Developers need to learn, and we’re going to do everything we can to help them learn.

Impact

There has been an uptick in Kubernetes adoption despite its learning curve. It could take hours for developers to create a new service before containerization, or weeks if they didn’t know Puppet. Using the new platform, it takes as little as 10 minutes. Over 500 new services have been developed in the first eight months.

Spotify

Challenge

With over 200 million monthly active users worldwide, the audio-streaming platform was launched in 2008. Jai Chakrabarti, Director of Engineering, Infrastructure, and Operations, says, “We aim to empower creators and enable an immersive listening experience for all of our consumers today and into the future.” Spotify, an early adopter of microservices and Docker, had containerized microservices running across its fleet of VMs using Helios, a homegrown container orchestration system. “Having a small team working on the features was just not as efficient as adopting something that was supported by a much larger community,” he says, by late 2017.

Solution

“We saw the incredible community that had grown up around Kubernetes and wanted to be a part of that,” Chakrabarti says. Kubernetes offered more features than Helios. Furthermore, “we wanted to benefit from increased velocity and lower costs, as well as align with the rest of the industry on best practices and tools.” Simultaneously, the team wished to contribute its knowledge and influence to the thriving Kubernetes community. The migration, which would take place concurrently with Helios, should go smoothly because “Kubernetes fits very nicely as a compliment and now as a replacement to Helios,” according to Chakrabarti.

Impact

The team spent much of 2018 addressing the core technology issues required for a migration, which began late that year and is a major focus for 2019. “A small percentage of our fleet has been migrated to Kubernetes, and we’ve heard from our internal teams that they have less of a need to focus on manual capacity provisioning and more time to focus on delivering features for Spotify,” says Chakrabarti. According to Site Reliability Engineer James Wen, the largest service currently running on Kubernetes handles approximately 10 million requests per second as an aggregate service and benefits greatly from autoscaling. “Previously, teams would have to wait for an hour to create a new service and get approval,” he adds.

Adidas

Challenge

In recent years, the Adidas team was pleased with its technological choices—but access to all of the tools was a challenge. “Just to get a developer VM,” says Daniel Eichten, Senior Director of Platform Engineering, “you had to send a request form, give the purpose, give the title of the project, who’s responsible, give the internal cost center a call so that they can do recharges.” “In the best-case scenario, you received your machine within half an hour. Worst-case scenario is a half-week or even a week.”

Solution

To improve the process, “we started from the developer point of view,” and looked for ways to shorten the time it took to get a project up and running and into the Adidas infrastructure, says Senior Director of Platform Engineering Fernando Cornago. They found the solution with containerization, agile development, continuous delivery, and a cloud-native platform that includes Kubernetes and Prometheus.

Impact

Six months after the project began, the Adidas e-commerce site was running entirely on Kubernetes. The e-commerce site’s load time was cut in half. The frequency of releases increased from every 4-6 weeks to 3-4 times per day. adidas now runs 40% of its most critical, impactful systems on its cloud-native platform, with 4,000 pods, 200 nodes, and 80,000 builds per month

Benefits

Using Kubernetes and its huge ecosystem can improve your productivity
Kubernetes and a cloud-native tech stack attract talent
Test and autocorrection of applications
Kubernetes is a future proof solution
Kubernetes helps to make your application run more stable
Kubernetes can be cheaper than its alternatives
It is developed by Google, which brings years of valuable industry experience to the table.
Largest community among container orchestration tools.
Offers a variety of storage options, including on-premises SANs and public clouds.
Adheres to the principles of immutable infrastructure.
Control and automate deployments and updates
Save money by optimizing infrastructural resources thanks to the more efficient use of hardware
Scale resources and applications in real-time
Solve many common problems derived from the proliferation of containers by organizing them in “pods”

Disadvantages

Kubernetes can be an overkill for simple applications
The transition to Kubernetes can be cumbersome
Limited functionality according to the availability in the Docker API.
Highly complex Installation/configuration process
Not compatible with existing Docker CLI and Compose tools
Complicated manual cluster deployment and automatic horizontal scaling setup

Kubernetes Ecosystem Glossary

Cluster

Is a set of machines individually referred to as nodes used to run containerized applications managed by Kubernetes.

Node

Is either a virtual or physical machine. A cluster consists of a master node and a number of worker nodes.

Cloud Container

Is an image that contains software and its dependencies.

Pod

Is a single container or a set of containers running on your Kubernetes cluster.

Deployment

Is an object that manages replicated applications represented by pods. Pods are deployed onto the nodes of a cluster.

Replicaset

Ensures that a specified number of pod replicas are running at one time.

Service

Describes how to access applications represented by a set of pods. Services typically describe ports and load balancers and can be used to control internal and external access to a cluster.

Containers: A standalone, executable package of software that includes all necessary code and dependencies.

Immutable Architecture: An infrastructure paradigm where servers are never modified, only replaced.

Infrastructure-as-Code: The practice of provisioning and managing data center resources using humanly readable declarative definition files (e.g., YAML).

Microservices: A series of independently deployable software services that, together, make up an application.

Vertical Scaling: Where you allocate more CPU or memory to your individual machines or containers.

Horizontal Scaling: Where you add more machines or containers to your load-balanced computing resource pool.

Conclusion

Kubernetes is a sophisticated system. It has, however, proven to be the most resilient, scalable, and performant platform for orchestrating highly available container-based applications, supporting decoupled and diverse stateless and stateful workloads, and providing automated rollouts and rollbacks.

There is no simple answer to whether or not adopting Kubernetes is the best option for you. It is dependent on your specific requirements and priorities, and many technical reasons are not even mentioned here. If you are starting a new project, working in a startup that wants to scale and develop more than just a quick MVP, or need to upgrade a legacy application, Kubernetes may be a good choice because it provides a lot of flexibility, power, and scalability. However, it always necessitates a time investment because new skills must be learned, and new workflows must be established in your development team.

However, if done correctly, investing the time to learn and adopt Kubernetes will often pay off in the future due to improved service quality, increased productivity, and a more motivated workforce.

In any case, you should make an informed decision, and there are numerous compelling reasons to use or avoid Kubernetes. I hope this post has assisted you in making the best decision for you.

The media shown in this article is not owned by Analytics Vidhya and is used at the Author’s discretion.

vijay

||📊Lead Technical Analyst at HP||Connecting data to dollar||📋Certified - Data Scientist||SAP ERP||Microsoft Data Analyst,LSS Yellow Belt||🥇3*MVP,1*APAC Champion@HP||🏆Blogathon Winner’22@Xebia,2*AVCC||AVCC Member’22||

Free Courses

4.7

Generative AI - A Way of Life

Explore Generative AI for beginners: create text and images, use top AI tools, learn practical skills, and ethics.

4.5

Getting Started with Large Language Models

Master Large Language Models (LLMs) with this course, offering clear guidance in NLP and model training made simple.

4.6

Building LLM Applications using Prompt Engineering

This free course guides you on building LLM apps, mastering prompt engineering, and developing chatbots with enterprise data.

4.8

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Explore practical solutions, advanced retrieval strategies, and agentic RAG systems to improve context, relevance, and accuracy in AI-driven applications.

4.7

Microsoft Excel: Formulas & Functions

Master MS Excel for data analysis with key formulas, functions, and LookUp tools in this comprehensive course.

MUID

Used by Microsoft Clarity, to store and track visits across websites.

Expiry: 1 Year

Type: HTTP

_clck

Used by Microsoft Clarity, Persists the Clarity User ID and preferences, unique to that site, on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.

Expiry: 1 Year

Type: HTTP

_clsk

Used by Microsoft Clarity, Connects multiple page views by a user into a single Clarity session recording.

Expiry: 1 Day

Type: HTTP

SRM_I

Collects user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Years

Type: HTTP

SM

Use to measure the use of the website for internal analytics

Expiry: 1 Years

Type: HTTP

CLID

The cookie is set by embedded Microsoft Clarity scripts. The purpose of this cookie is for heatmap and session recording.

Expiry: 1 Year

Type: HTTP

SRM_B

Collected user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Months

Type: HTTP

_gid

This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected includes the number of visitors, the source where they have come from, and the pages visited in an anonymous form.

Expiry: 399 Days

Type: HTTP

_ga_#

Used by Google Analytics, to store and count pageviews.

Expiry: 399 Days

Type: HTTP

_gat_#

Used by Google Analytics to collect data on the number of times a user has visited the website as well as dates for the first and most recent visit.

Expiry: 1 Day

Type: HTTP

collect

Used to send data to Google Analytics about the visitor's device and behavior. Tracks the visitor across devices and marketing channels.

Expiry: Session

Type: PIXEL

AEC

cookies ensure that requests within a browsing session are made by the user, and not by other sites.

Expiry: 6 Months

Type: HTTP

G_ENABLED_IDPS

use the cookie when customers want to make a referral from their gmail contacts; it helps auth the gmail account.

Expiry: 2 Years

Type: HTTP

test_cookie

This cookie is set by DoubleClick (which is owned by Google) to determine if the website visitor's browser supports cookies.

Expiry: 1 Year

Type: HTTP

_we_us

this is used to send push notification using webengage.

Expiry: 1 Year

Type: HTTP

WebKlipperAuth

used by webenage to track auth of webenagage.

Expiry: Session

Type: HTTP

ln_or

Linkedin sets this cookie to registers statistical data on users' behavior on the website for internal analytics.

Expiry: 1 Day

Type: HTTP

JSESSIONID

Use to maintain an anonymous user session by the server.

Expiry: 1 Year

Type: HTTP

li_rm

Used as part of the LinkedIn Remember Me feature and is set when a user clicks Remember Me on the device to make it easier for him or her to sign in to that device.

Expiry: 1 Year

Type: HTTP

AnalyticsSyncHistory

Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

lms_analytics

Used to store information about the time a sync with the AnalyticsSyncHistory cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

liap

Cookie used for Sign-in with Linkedin and/or to allow for the Linkedin follow feature.

Expiry: 6 Months

Type: HTTP

visit

allow for the Linkedin follow feature.

Expiry: 1 Year

Type: HTTP

li_at

often used to identify you, including your name, interests, and previous activity.

Expiry: 2 Months

Type: HTTP

s_plt

Tracks the time that the previous page took to load

Expiry: Session

Type: HTTP

lang

Used to remember a user's language setting to ensure LinkedIn.com displays in the language selected by the user in their settings

Expiry: Session

Type: HTTP

s_tp

Tracks percent of page viewed

Expiry: Session

Type: HTTP

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

Indicates the start of a session for Adobe Experience Cloud

Expiry: Session

Type: HTTP

s_pltp

Provides page name value (URL) for use by Adobe Analytics

Expiry: Session

Type: HTTP

s_tslv

Used to retain and fetch time since last visit in Adobe Analytics

Expiry: 6 Months

Type: HTTP

li_theme

Remembers a user's display preference/theme setting

Expiry: 6 Months

Type: HTTP

li_theme_set

Remembers which users have updated their display / theme preferences

Expiry: 6 Months

Type: HTTP

Reading list

Basics of Machine Learning

Machine Learning Lifecycle

Importance of Stats and EDA

Understanding Data

Probability

Exploring Continuous Variable

Exploring Categorical Variables

Missing Values and Outliers

Central Limit theorem

Bivariate Analysis Introduction

Continuous - Continuous Variables

Continuous Categorical

Categorical Categorical

Multivariate Analysis

Different tasks in Machine Learning

Build Your First Predictive Model

Evaluation Metrics

Preprocessing Data

Linear Models

KNN

Selecting the Right Model

Feature Selection Techniques

Decision Tree

Feature Engineering

Naive Bayes

Multiclass and Multilabel

Basics of Ensemble Techniques

Advance Ensemble Techniques

Hyperparameter Tuning

Support Vector Machine

Advance Dimensionality Reduction

Unsupervised Machine Learning Methods

Recommendation Engines

Improving ML models

Working with Large Datasets

Interpretability of Machine Learning Models

Automated Machine Learning

Model Deployment

Deploying ML Models

Embedded Devices

The Tale of Kubernetes!

Introduction

Essential of Kubernetes Architecture

Prerequisites

Key Features of Kubernetes

Difference Between Kubernetes vs Docker

Real-World Case Studies

Benefits

Disadvantages

Kubernetes Ecosystem Glossary

Conclusion

Login to continue reading and enjoy expert-curated content.

Free Courses

Generative AI - A Way of Life

Getting Started with Large Language Models

Building LLM Applications using Prompt Engineering

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Microsoft Excel: Formulas & Functions

Recommended Articles

Responses From Readers

Write for us

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk

SRM_I

SM

CLID

SRM_B

Google (7)

_gid

_ga_#