How can we find out mathematically how complicated is a concept to train a machine with?

More Pouya Sinaian's questions See All

How can you calculate the description length of a random forest?

In research, I have a set of random forests which are generated from a given set of datasets. Now I would like to calculate the description size (based on the MDL method proposed by Quinlan) of...

02 March 2013 9,189 4 View

What is the effect of eliminating some fixed attributes from all datasets on classification?

In a simple experiment, I have a dataset with 200 samples, each includes 5 attributes and a binary class label. 3 of these attributes are supposed to have equal measures for all samples in the...

31 December 2012 2,860 2 View

What is the best computer software package for Random Forest Classification?

I want to have information about the size of each tree in random forest (number of nodes) after training. I usually use WEKA but it seems it is unusable in this case.

10 November 2012 8,029 16 View

How can we Visualize Random Forest in R?

In a research, I need to visualize each tree in random forest due to count the number of nodes included in each tree. I use R language to generate random forest but couldn't find any command to...

10 November 2012 789 8 View

Which criterion is better in order to define Random Forest size?

Since random forest includes a bunch of random decision trees, it is not clear when we say forest size, it can be : 1) number of bits it takes 2) number of decision trees included in forest 3)...

09 October 2012 6,733 3 View

How can we calculate the shannon entropy for a dataset which we want to train a machine with?

Suppose we deal with a dataset with different kind of attributes (numeric and nominal) and binary class. How can we find a unique number as the shannon entropy of this dataset (as a presentation...

09 October 2012 4,624 15 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Do you know best mines of western part of Afghanistan?

I want to know more about Mn deposits in west of Afghanistan.

07 August 2024 3,427 1 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

Matthias Scheller Lichtenauer

Your concept reminds me of the idea of Kolmogorov complexity - the length of the shortest program to output a string is the complexity of the string. Yet, spheres are highly regular. In an abstract manner, you try to generalize Kolmogorov's idea to a space, merging adjacent regions. But adjacency depends on the way you define geometry in that space. Many regularizing approaches in machine learning therefore calibrate concept complexity with cross validating approaches.

Pouya Sinaian

Thank you Matthias. The question is, does this method of sphere set covering influenced by the size of training set or it is not resistant when we increase the amount of training example in a certain Concept?

Ramon Garcia-Martinez

You should consider the complexity of the concept is related to the complexity of the domain to which it belongs. The same machine learning algorithm has different behaviors depending depending on the complexity of the domain on which is intended to teach concepts.

Xavier Hinaut

Interesting question. But can you give some examples of concepts you want to learn and how are those concept represented in your input space? Because concepts can be understood in different ways ...

Thanks Ramon, but can you provide me little more information? for example some references about domain complexity and the relation between domain complexity and concept complexity. I appreciate it

Dear Xavier, this is a general question. I don't have any specific dataset in my hand to with now. Actually, I am trying to find a general idea to develop a pure computational theory i.e. suppose we want to work with 1000 different datasets presenting 1000 different concepts in different scopes. Now the question is,if there is a unique criteria or method measuring the complexity of this concepts?

Dear Pouya, I understand that you want a general measure. But I do not understand what you have in mind when you talk about concept. Is it like a category, a cluster, an analogy, or something else? Are your concepts independent (i.e. the intersection of all concepts is void)?

Xavier, I'm not sure what you exactly mean but let me describe it in this way using a classical example : suppose we want to train a machine to recognize whether the situation is good for playing tennis or not and it depends on many weather factor, playing tennis with all relevant factors are the concept that we want to train the machine with. In the other case, suppose a machine want to determine if a cat is in an image or not. depending on presented dataset, the attributes are different in quality and quantity. Here, being a cat in an image with its all presented attributes are the concept.

Please, see: http://www.unla.edu.ar/sistemas/gisi/GISI/papers/DySES-2012_Garcia-Martinez_&_Lopez-Nocera.pdf

Pouya, I think I see better what you mean now. Are you familiar with V. Vapnik theories of learning? You may found some hints for your question. But, to my point of view, the hardness of training a concept is higly dependent on the data you have! If all the cats are in bright images, and all images that have no cats are dark, then it is an easy task that a simple perceptron can do.

thanks Ramon and Xavier