Which is the best way for parallelizing the Adaboost algorithm?

More Chirag Kharwar's questions See All

Can we compare performance results of same program implemented in hadoop environment and opencl/CUDA?

I want to develop one program in hadoop and the same in either opencl or CUDA programming language. I want to know can we compare both of them accroding to their performance, by running one on a...

10 November 2014 1,514 4 View

What are the future applications that can be created using wireless sensor networks?

31 December 2013 9,485 2 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Do you know best mines of western part of Afghanistan?

I want to know more about Mn deposits in west of Afghanistan.

07 August 2024 3,427 1 View

Separation of organic acids-HPLC?

Hello What should be done to separate and identify organic acids in HPC when their RetTime is the same?Like oxalic acid with Propanoic Acid.or acids that have a very close RetTime.

07 August 2024 8,782 3 View

Is Galaxy.org good to use for research for analyzing data and for publication?

Hello all, I wanted to know, can I use galaxy (USA, Europe or Australia) platform for analyzing the shotgun data, and can it be used for publication purpose as well? Thanks :)

06 August 2024 6,610 4 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

Better ways to analyze the qualitative and quantitative data in a sequential explanatory mixed method approaches

04 August 2024 2,703 6 View

How can I interpret the data without the need of solving it manually?

How can I interpret the data gathered without solving?

03 August 2024 9,054 3 View

Which test should be used to study association among demographic profile and awarness level?

i have to study the awareness and adoption level of cloud computing in a district of India. i also want to use association among demographic variables like gender, age, education, income etc and...

02 August 2024 2,420 3 View

Roberto Esposito Popular answer

The training of an AdaBoost ensemble in its purest form can be hardly parallelized. The problem is that the weightings of the examples in the next iteration of the algorithm depends on the performances of the previous iteration. This implies that a new iteration cannot be started before the previous one finishes. On the contrary, the classification through an AdaBoost ensemble is easily parallelized: each weak classifier can work independently from the others and thus it can be executed on its own thread or remotely. I've skimmed through the paper suggested by Stephane Genaud and indeed it seems they the authors experiment with a variation on the algorithm, not with its original formulation (they create a number of workers and a master, the master will build the final ensemble, the workers train a number of classifiers and return the best to the master).

C.P. Ravikumar

In order to parallelize an algorithm, you need to understand the nature of the algorithm and see what operations are dependent and what are independent. Identifying the data parallelism, control parallelism and temporal parallelism in the execution of the algorithm will help you in understanding the different ways in which you can parallelize the algorithm. The "best" way will depend on what machine you are targeting. Please follow the text books on parallel programming to help you further. There may also be publications on parallelizing classifiers that might help you get started. Wish you all the best.

Mohsen Mahmoudi Aznaveh

I am not familiar with Adaboos algorithm. However, if you want a practical way to parallelize an algorithm efficiently, you have to fully understand the nature of a parallel algorithm and different way of implications. The easier way is to use Parallel function in common libraries that are usually implemented good.

Stéphane Genaud

Let me refer you to this paper.

http://icube-publis.unistra.fr/papr/docs/files/2811/boosting_icpads_2009.pdf

The parallelizaton scheme is based on the distribution of classifiers over the processors. A master owns all classifiers and distributes them to slaves who computes the classifiers they have been assigned using a local copy of the training dataset.

Roberto Esposito