How could YARN improve Hadoop scalability?

More Peipei Wang's questions See All

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

Why does the MFDFA algorithm need to calculate the profile of the time series?

As described in the Multifractal detrended fluctuation analysis (MFDFA) algorithm, it at first calculates the profile of the time series, and then other steps are operated on the profile....

05 August 2024 9,366 2 View

Differences between deep seated landslides and slope destabilization?

Hi, Could someone explain the primary differences between deep-seated landslides and slope destabilization? In particular, definition and characteristics, mechanisms and triggering factors,...

02 August 2024 4,212 2 View

How to completely dissolve soy lecithin?

Hello, I'm looking to dissolve soy lecithin in order to create a liposome. I've tried using various solvents like pure ethanol, methanol, and a mixture of ethanol and methanol but they didn't...

02 August 2024 6,980 4 View

Have you heard about Technological Surveillance on Oceanographic Research Vessels?

It is important to highlight that technological surveillance on oceanographic vessels is a collaborative effort in which different specialists work together to effectively leverage technology in...

31 July 2024 1,633 0 View

A question about arbuscular mycorrhizal???

How long it takes for arbuscular mycorrhiza to establish and produce benefits under experimental conditions？

25 July 2024 5,208 2 View

Is it possible to run the AIMD within a system using virtual crystal approximation (VCA)?

I want to study the thermal properties of a mixed system which is constructed by virtual crystal approximation in VASP. When I try to run the ab initio Molecular Dynamics of this system in VASP, I...

19 July 2024 6,569 3 View

If I want to invent my own hypothesis testing method, where should I get started ?

15 July 2024 5,376 5 View

Recommendations for Rapid Publication Journals in Traffic and Transportation?

I am currently working on a research paper focused on the control of Connected and Autonomous Vehicles (CAVs) utilizing multi-agent reinforcement learning methods. At this stage, I am seeking a...

14 July 2024 2,620 2 View

I've earned 1 best paper award and 4 best oral presentation awards. What should I do next?

I've earned 1 best paper award and 4 best oral presentation awards. What should I do next to elevate my academic capabilities to the next level ?

14 July 2024 6,071 5 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

Separation of organic acids-HPLC?

Hello What should be done to separate and identify organic acids in HPC when their RetTime is the same?Like oxalic acid with Propanoic Acid.or acids that have a very close RetTime.

07 August 2024 8,782 3 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

In the case of a wound l recurrence after radical breast cancer and sentinel lymph node biopsy. Are the sentinel lymph node procedure recommended?

In the case of a wound l recurrence after radical breast cancer and sentinel lymph node biopsy. Are the sentinel lymph node procedure recommended? If no axillary lymph node dissection was not...

05 August 2024 8,056 1 View

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity?

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity? What is the acceptable percentage of error (regardless of the metric)? Could you suggest...

03 August 2024 5,358 0 View

Interested in a SCOPUS collaboration?

Hi RG family. My team and I are working on some SCOPUS publications and we need co-authors who are willing and capable of undertaking both qualitative and quantitative-based studies. The scope...

02 August 2024 7,843 0 View

Which test should be used to study association among demographic profile and awarness level?

i have to study the awareness and adoption level of cloud computing in a district of India. i also want to use association among demographic variables like gender, age, education, income etc and...

02 August 2024 2,420 3 View

Interested in a SCOPUS collaboration?

Hi RG family. My team and I are working on some SCOPUS publications and we need co-authors who are willing and capable of undertaking both qualitative and quantitative-based studies. The scope of...

02 August 2024 8,572 0 View

Suzanne McIntosh

Yes, you are correct that the Resource Manager has cluster-wide responsibilities just as the Job Tracker had. However, the job-related responsibilities of the Job Tracker are largely handled by a new component - the Application Master. There is one Application Master per job - this is why YARN scales better than the previous Hadoop architecture. The Application Master for a given job can run on an arbitrary cluster node, and it runs until the job reaches termination.

Feras Awaysheh

Yarn enhanced Hadoop’s ecosystem by separating the Data processing engine (MR programming model) and resource management capabilities. MR was originally designed as a batch-oriented system, though; it becomes often used for other data analysis and processing types, which raised an emergent need for separating these two major functionalities. As a direct impact of loose coupling between cluster resource management and the application framework, the framework complexity was reduced, the system flexibility, scalability and performance were enhanced as well. Yarn become the paradigm backbone and responsible for scheduling users application tasks, implementing security controls and managing the computing resources, besides providing the high availability features of Hadoop

Peipei Wang

Both Suzanne McIntosh and Feras Awaysheh mentions that the master node is freed from the job management, and these freed resources could be used for management of more node. That does perfectly explain the improvements of the scalability.

I am now trying to understand why the number of nodes could be the factor limiting its scalability. Could it be the resource limitation of a single process since it keeps the state information of all slave nodes? If this is the case, will allocate the ResourceManager to a more powerful server help improve the scalability?