Hadoop:
Hadoop is an open-source programming framework for putting away information and running applications on bunches of product equipment. It gives monstrous capacity to any sort of information, tremendous preparing power and the capacity to handle for all intents and purposes boundless assignments or employments.
Components Of Hadoop:
At present, four center modules are incorporated into the essential structure from the Apache Foundation:
Hadoop Common: the libraries and utilities utilized by other Hadoop modules.
Hadoop Distributed File System (HDFS): the Java-based versatile framework that stores information over various machines without earlier association.
MapReduce: a product programming model for preparing huge arrangements of information in parallel.
YARN: asset administration system for planning and taking care of asset solicitations from appropriated applications. (YARN is an acronym for Yet Another Resource Negotiator.)
Other software components that can run on top of or alongside Hadoop and have achieved top-level Apache project status include:
Hadoop is an open-source programming framework for putting away information and running applications on bunches of product equipment. It gives monstrous capacity to any sort of information, tremendous preparing power and the capacity to handle for all intents and purposes boundless assignments or employments.
Components Of Hadoop:
At present, four center modules are incorporated into the essential structure from the Apache Foundation:
Hadoop Common: the libraries and utilities utilized by other Hadoop modules.
Hadoop Distributed File System (HDFS): the Java-based versatile framework that stores information over various machines without earlier association.
MapReduce: a product programming model for preparing huge arrangements of information in parallel.
YARN: asset administration system for planning and taking care of asset solicitations from appropriated applications. (YARN is an acronym for Yet Another Resource Negotiator.)
Other software components that can run on top of or alongside Hadoop and have achieved top-level Apache project status include:
- Pig: A stage for controlling information put away in HDFS that incorporates a compiler for MapReduce programs and an abnormal state dialect called Pig Latin. It gives an approach to perform information extractions, changes and stacking, and fundamental examination without writing MapReduce programs.
- Hive: An information warehousing and SQL-like question dialect that presents information as tables. Hive writing computer programs is like database programming. (It was at first created by Facebook.)
- HBase: A nonrelational, disseminated database that keeps running on top of Hadoop. HBase tables can serve as information and yield for MapReduce occupations.
- HCatalog: A table and capacity administration layer that helps clients share and get to information.