Cluster management tools – Big Data

Big Data application and resource managers Hadoop Map-Reduce is a distributed resource manager and data processing. Provides a scheduling infrastructure that provides algorithms for performing the distributed calculations. YARN is an operating data system and distributed resource Manager. Evolution of Map-Reduce. It can run on Linux and Windows. Standalone is an operating data system and…

Read More »

Big data-security tools, machine learning, labelling,…

Security Tools Apache Ranger is a framework to enable, monitor and manage comprehensive data security across the Hadoop platform. Apache Sentry is a system for applying functionality-based authorization of fine granularity to data and metadata stored in a Hadoop cluster. Knox is a Gateway application to interact with the REST API and the Apache Hadoop…

Read More »