Ansble Task 11.1

4 min readApr 3, 2021

Ansble Task 11.1

Hadoop and Ansible…?

What is Ansible?

Ansible is an open-source automation tool. It is used for automating configuration management, cloud provisioning, application deployment, intra-service orchestration, and other IT tasks. It has its own declarative language to describe system configuration.

Ansible Playbook:

Ansible playbook is an ordered list of tasks, saved so you can run those tasks in that order repeatedly. Playbooks include variables as well as tasks. Playbooks are written in YAML and are easy to read, write, share and understand.

Some common terminologies:-

Control Node: A control node is a Linux server that has Ansible installed on it and is used for managing remote hosts or nodes.
Managed Node: The network devices (and/or servers) you manage with Ansible. Managed nodes are also sometimes called “hosts”. Ansible is not installed on managed nodes.
Inventory: It’s a file that contains the list of the managed nodes. It contains their IP address, username, password, connection type, etc.

What is Hadoop?

Hadoop is the product of Apache. It is an open-source distributed processing framework that manages data processing and storage for big data applications in scalable clusters of computer servers. It’s written in java. It is designed to scale up from a single server to thousands of machines, each offering local computation and storage.

Some common terminologies:-

Namenode/Masternode: NameNode is the master node in the Apache Hadoop HDFS Architecture that maintains and manages the blocks present on the DataNodes/slave nodes.
Datanode/Slavenode: DataNodes are the slave nodes in HDFS. Datanodes are responsible for storing actual data.

Installation And Configuration-Ansible

Step 1: Install Ansible

Command: pip3 install ansible