Input:
- Normalized dataset based on RNA-Seq with ensemble IDs as row names.
Problem:
- Multiple ensemble gene IDs are mapping to the same gene symbol. Thus the same gene symbol has multiple expression values.
Question:
- How to deal with the multiple mappings in the dataset? Should I sum up the expression values of different IDs, take the mean, or any other suggestions?