Hi folks, I'm a computer scientist PhD student, and I'm working on implementing Multi-Task Learning architecture for a better generalization aims, it will be throughout a Deep Learning model. I have some questions concerning MTL algorithms and its feasibility, for those whom already worked on the same project, here are my questions:
1- Can we design an MTL architecture model based on different task's definition ? Example: task 01: is a classification, task 02: is a clustering (mixing between supervised and unsupervised tasks) is it possible, or we have to design a common and homogeneous architecture ?
2- Is it a mandatory to assign a specific dataset for each task ? Or, we can use a common and global dataset for both shared layers and tasks specific layers (example: an ecommerce historical purchase) ?
3- According to you, what are the best pretraining MTL architecture models that I could rely on ? Thanks in advance !