Multi-Task Learning Architecture for Inductive Learning ability ?

22 May 2024 1 9K Report

Hi folks, I'm a computer scientist PhD student, and I'm working on implementing Multi-Task Learning architecture for a better generalization aims, it will be throughout a Deep Learning model. I have some questions concerning MTL algorithms and its feasibility, for those whom already worked on the same project, here are my questions:

1- Can we design an MTL architecture model based on different task's definition ? Example: task 01: is a classification, task 02: is a clustering (mixing between supervised and unsupervised tasks) is it possible, or we have to design a common and homogeneous architecture ?

2- Is it a mandatory to assign a specific dataset for each task ? Or, we can use a common and global dataset for both shared layers and tasks specific layers (example: an ecommerce historical purchase) ?

3- According to you, what are the best pretraining MTL architecture models that I could rely on ? Thanks in advance !

Md Shohanur Rahman

1. Combining Different Task Types

Yes, you can design an MTL architecture that combines different tasks, like classification (supervised) and clustering (unsupervised). Use shared layers to learn common features, followed by task-specific layers for each task type.

Example Architecture:

Input Layer
Shared Layers (e.g., convolutional or dense)
Task-Specific Branches: (Classification: Dense layers → Softmax, Clustering: Dense layers → Embedding output)

2. Dataset Allocation

You can use a common dataset for both shared layers and task-specific layers, especially if the tasks are related.

Approach:

Shared Dataset: Useful for tasks that can benefit from shared representations.
Task-Specific Augmentation: Apply specific preprocessing if needed.

3. Recommended Pretrained MTL Models:

BERT: Great for NLP tasks, with shared transformer layers and task-specific heads.
MT-DNN: An extension of BERT, designed for multiple NLP tasks.
Multi-Task CNNs: Shared convolutional layers with task-specific fully connected layers for image tasks.
U-Net: Shared encoder with multiple decoder heads for tasks like segmentation and classification.

Pretraining Approach:

Transfer Learning: Fine-tune a pretrained model on your specific tasks.
Joint Training: Train on multiple tasks from the start to learn generalized features.
These approaches will help you effectively implement a robust MTL architecture.

Badges
Science topic

More Nassim Lateb's questions See All

How to define a step in the design interval using an optimization algorithm?

Hello experts This question is shared with one of my research team. We are dealing with an optimization problem in which the algorithm will choose the cross-section of the column (RC structure)...

26 March 2023 6,328 4 View

Comparing Mike 11 and SWAT?

I need a water quality-quantity model for improving the water quality of the Amirkabir dam, and I don’t know considering the limitations and advantages of both models: SWAT and MIKE 11 which one...

03 May 2022 4,293 6 View

Fitch Connect Database ?

Is there any one on the network who can provide access to the Fitch Connect Database regarding the banking metrics ?

22 April 2020 4,510 0 View

Why value "S" and "U" (shell parameters) is limited to 4 ?

Hi evrybody, when wa calculate stress du to radial local load (Pr), and/or moment (M), on a spherical sherical shell or head Value S to find stresses at distance x from centerline in the...

04 March 2020 3,629 1 View

What is the thermal behaviour of SiGe HBTs?

My name is nassim aliouche, I am a final year student in microtechnology at Marne la Vallée University , i am working on a presentation about the bipolar Transistor SiGe So , for For SiGe HBTs...

11 December 2019 6,578 1 View

Are there any vehicle models or tool boxes for testing fuel (pulse width) and ignition maps?

I'm currently doing a Master's project for mapping an ECU and was wondering whether I'd be able test those maps, using a model, as though I was running the vehicle on a dyno, or even testing the...

26 March 2019 5,704 1 View

How to protect the transmitted information by channel coding?

I have secondary informatins to protected and must be coded in the transmission channel. I'm looking for ideas or matlab codes that explain how to encode this informations.

25 March 2019 4,601 6 View

Can we write a function to define upper and lower bounds of the genetic algorithm?

Dear researchers I need to write a script to define the upper and the lower bounds for a genetic optimization using functions scripts. Each function (for Upper and Low bound) we be handled in...

10 February 2019 2,596 7 View

In a researcher proposal, should we outline contribution to theory throughout the literature review or separately in the conclusion of the intent?

To explain contribution to theory of a research study in a research intent, should we do it all along the document or in the conclusion?

06 January 2019 339 5 View

How can I define a materials parameter according to time under comsol 5.0

How can I define a materials parameter according to time under comsol 5.0 because when i try to solve my modele i have warning message like this Variable non définie. - Variable: t - Géométrie:...

18 May 2017 3,547 1 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View