I am working on T cell receptor repertoire analysis obtained from RNA seq data. I am trying to understand the CDR3 length and amino acid usage in my data set, where I have variable lengths of amino acids. My query is whether it is right to remove the first three and last three amino acids (due to their conservative nature and less involvement in antigen binding). If not, could anyone let me know how to proceed with the same?
Thank you