What is best tool for machine learning in big data?

More Wissem Inoubli's questions See All

How can i calculate the mixing pressure inside ejector using EES software ???

i m preparing a program that models the behavior of a working fluid inside an ejector, but i kindly have a problem concerning the method used to determine the mixing pressure to ensure that the...

12 December 2020 4,207 0 View

The use of domestic wastewater in mineral processing?

What are the advantages and disadvantages of the utilisation of treated domestic wastewater in the mineral processing and especially in the phosphate flotation?

13 September 2017 7,524 4 View

How Ican remove chloride from seawater?

It is possible to remove chloride from seawater by biomineralization . Which organisms can deal this process and which minerals will be formed.

04 September 2017 318 5 View

How can I evaluate frameworks Big data ?

Hello everybody, I am currently evaluating two big data frameworks storm and flink I begin by test the number of messages processed by each frameworks. is there any other criteria I can use to...

06 July 2016 5,362 3 View

How we can distinguish between gaize and opoka?

We have a siliceous sample which is constituted mainly by silica and quartz with calcite. We note the presence of some molluscs and rare radilaria. The chemical composition is shown below. XRD and...

04 March 2016 9,888 2 View

What is best tool to implement recommendation system?

what is best tool to implement recommendation system?

18 February 2016 8,974 8 View

How to evaluate storm vs flink in stream mode?

i evaluated hadoop and spark in batch mode with run time between this frameworks, in three volume dataset, and in secondary step, i want evaluate another frameworks in real time mode, i want know...

10 February 2016 287 0 View

How can I test the performance of Spark, flin and storm in stream mode?

How can I test the performance of Spark, flin and storm in stream mode? for exampel we test hadoop and spark tools with simple program with larg dataset

03 February 2016 8,017 3 View

How can I separate sulfur from siliceous compounds?

Diatomaceous earths are used for acid filtration, however after few time the pores will be clogged by sulfur. We search minerallurgical techniques in order to reuse Diatomaceous earths.

09 October 2015 1,967 5 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

Adewole K. S. Popular answer

This depends on your capability in terms of how you are able to manipulate things. To make this clear to you, Mahout is a machine learning tool that allows you to execute a number of algorithms, however, mahout was developed to work on top of hadoop, which means as time goes on you may need to write your little codes or manipulate existing codes to address some problems. This means that you will face the challenge of writing mapreduce codes to solve your problem. However, spark has addressed this problem of having to face the headache of writing mapreduce codes always. Spark support lots of machine learning algorithms that you can use and they are easy to implement. Therefore, I will advise you go for it. You can check spark in action by visiting this link:

http://spark.apache.org/docs/latest/mllib-clustering.html

Like I said, it all depends on the problem you are trying to solve and your capability in handling the tools. You can also use ECL-ML (https://hpccsystems.com/download/free-modules/ecl-ml) to process large volume of data. In addition, do not also forget that WEKA, the popular machine learning tool support distributed data mining. This means that you can use a lot of algorithms on WEKA in distributed mode (http://weka.sourceforge.net/packageMetaData/distributedWekaHadoop/index.html).

All the best.

Sabeur Aridhi

May be you can use Mahout.

Best,

Sabeur

Rizal Setya Perdana

I think you can use Apache Hadoop. The essential things in the big data is how to distributing the computing process, and Apache Hadoop is the right framework.

https://hadoop.apache.org/

Adewole K. S.

Andreas Meier

You should definitely check out Mahout which might be useful for your purposes.

Praveen Kumar Rajendran

Hadoop, Greenplum are some of the best Big data tool :)

Nishant Agarwal

Spark with Graphx and mllib

Eshan Chattaraj

Mahout can help with the problem.And also you can use the package of Python as well as R programming language for the calculation part