PROtein FEATure eXtractor

About ProFeatX

Most machine learning methods require lots of data, with every data point having the same vector size. However, proteins are amino acid sequences of variable length, which makes it essential to extract a definite number of features from all the proteins for them to be used as input. There are numerous methods to achieve this, but only several tools let researchers encode their proteins using multiple methods without having to use different programs or, in many cases, code these algorithms themselves, or even come up with new algorithms. ProFeatX server is a web tool that contains 32 descriptors to extract protein features, offering hybrid encodings, PPI support and accepts DNA/RNA sequences. ProFeatX, in its standalone version, offers 50 descriptors, and lets the user encode big files (>50MB).