How to evaluate a whole cognitive architecture/agents?

More Samer Schaat's questions See All

Will Artificial Intelligence replace human intuition and creativity?

Considering Artificial Intelligence as a double-edged sword in human life.

10 November 2023 3,104 15 View

What is the role of artificial intelligence in sports training?

Does artificial intelligence play a role in the process of preparing athletes at a high level?

07 November 2023 2,609 5 View

In social science, scientific realism states that theoretical, unobservable entities are ontologically real; is critical realism the same?

Epistemologically, scientific realism is (say, mainly) based on the idea that we generate theoretical entities to understand, describe, and discover a structure in the reality; these entities are...

01 February 2023 254 4 View

Could you please telling me which type of Acetobacter produce high amount of acetic acid to be used in large scale?

I want to produce acetic acid in large amounts, so I'm looking for Acetobacter that give high percentage of Acetic acid,.

27 September 2022 7,730 0 View

How to generate 180 degree phase-shift between two PWM signals using TMS320f28377d DSP module?

Hi everyone, I am using a TMS320f28377d DSP module for my system (multilevel inverter) which requires generating 24 PWM signals. The good thing in this DSP is that it can generate 24 PWM signals....

02 October 2021 7,610 6 View

How can sign language be used for deaf people with moderate intellectual disabilities?

There is great difficulty in translating sign language for deaf people with moderate intellectual disabilities. I believe that there are other methods that facilitate sign language communication...

16 March 2021 9,457 3 View

How to make a topical application?

greetings I want to ask about the way of making topical application of a medicinal product that not used topically before, how we can formulate it as a topical dosage form? your assistance is...

16 November 2020 7,208 5 View

How can i simulate the lightning to the ground with time domain?

my project is to simulate lightning that strikes wind turbine. the terminal is cloud and wind turbine is ground. i want to use time domain and see how the discharge of lightning travels to ground.

26 June 2020 6,613 1 View

Flow distribution in prmary surface heat exchanger?

Hello, I am familiar with plate fin heat exchangers, however when it comes to primary surface heat exchangers 5 shown in the figure) I need to know how the flow is distributed from the header to...

23 June 2020 2,531 0 View

What is the full script that I can use to extract sequences from fasa file by using sequenc IDs in text file using python 3 in pycharm environment ?

I want to extract sequences from fasta file using a text file witj ids in python

29 May 2020 1,889 3 View

Could you recommend some articles on Urban Transportation System optimization and Innovation?

13 August 2024 2,595 3 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

What are examples of AI for good projects a teacher can assign to students?

So I am organizing an AI seminar. What are possible AI projects in the AI for good spirit? something the students can do and have an impact?

08 August 2024 9,437 4 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How to design human-centered classroom in the age of A.I.?

08 August 2024 347 5 View

Jan Tünnermann

I suggest that such a system can only be evaluated with a concrete task that allows to quantify success. This also involves considering the embodiment of the system. So in that sense the competitions you mention seem useful. However, typically you would want your cognitive architecture to be as general as possible and not limited to one specific task; this should be considered setting up a set of evaluation tasks. Comparison to empirical results from humans seems useful if you either (1) want to mimic their behavior (e.g. in some HCI situation) or (2) if you explicitly want to model human cognitive system (i.e. to make predictions how a person would act in some task). The task and embodiment should be kept in mind. There is no use e.g. to compare human gaze-behavior with that of a proposed 6-armed and 3-eyed robot that catches flies out of the air --- nor to judge a human in fly catching task.

I agree that the question of evaluating such systems is not sufficiently addressed yet. Often the definition of a target system and task is missing and comparisons are made with ground-truth from human subjects whose behavior is necessarily influences by several cognize states that are not part of the evaluated system (e.g. influence of knowledge). I think it would be an advancement if hierarchical classifications* would be created for such systems with tasks and metrics provided and assigned to the classes. A system can perform different tasks of the lower hierarchy levels and it can be compared with specialized solutions on the lower levels and with other systems on the same level.

*e.g.: System-that-can-do-everything

Samer Schaat

Thanks Jan. What do you suggest as evaluation tasks? Is the definition of use and test cases with pre and post-conditions etc (as in software engineering) and testing the system (in a simulation) if it fulfills the requirements a appropriate way to evaluate it? E.g. is an agent able to survive in an artificial life simulation.

I am afraid that such a method of evaluation would lead to task models rather than process models, which should be approached for a cognitive architecture.

Russell C. Thomas

One way to evaluate cognitive agent architectures is to identify their capabilities, and compare these to both theory and empirical research in cognitive and behavioral psychology. This can help answer questions regarding HOW agents will accomplish certain tasks in certain environments. Of course, it doesn't answer how well they will perform. For example, if the agent architecture is focused on Belief-Intention-Action (BIA) reasoning using static production rules, it would be inappropriate to use it in experiments where you are evaluating agent creativity (formulating and using novel concepts).

Leonid Perlovsky

Dear Samer and Russell,

I agree with Russell,

The best cognitive agent is human brain, and the best approach is to study brain-mind and model it. Of course BIA is not a valid model of the mind. I do not think it is even worth mentioning.

John David Sanders

The Agent paradigm is only an idea for a cognitive model and as stated by Leonid there is only one valid example - and we don't have a model for that yet.

Imagine that all of your cognitive aqents have a shared language (assumption: they all share the same environment) if you can get them to produce plausible discourse about their environment and it is understood and used by the others then you getting somewhere. Note discourse production is not the same a pre-planned/defined communications.

To solve this problem you will need to make their environment very rich and totally consistent. An then you need in internalised model of that environment within your agent.

Note: this problem has nothing to do with the Turing test - the key is the degree of coupling between agent and the environment and its internalisation as a model.

Thank you John Sanders for clarifying this. My response referred to rather concrete cognitive systems (probably not what Samer Schaat meant) but they do share the problem of evaluability to some degree. I'm not familiar with he idea of the agent paradigm but wouldn't you say---when you move from the idealized model to practical systems---that the concrete tasks, environments and embodiment are of crucial importance for the evaluation (which must not necessarily agree with those of humans)? Even if a universal model is the goal still progress must be made measurable somehow.

Dear John and Jan,

I agree. I would add that the best evaluation is practical use in the intended environment. It might also be easier than trying to simulate and adequate test.