Thank you for open sourcing your work. I am working on an implementation of GNN on protein PDB files. What I am looking for is the feature vector of each protein generated before the input in the dense layer of GNN. I am able to save the files in a pickle file. But the issue is if I put 20 PDB data then it shows 39 training samples,3 validation samples and 4 test samples. I can't figure out why and how this is happening as split = 0.7, 0.1, 0.2.
An explanation would be very helpful. Also if you can suggest me a way to save the feature vector of each protein generated before the input in the dense layer of GNN. I need to use the feature vector for further analysis.
Thank you for your time and consideration. I will wait for your response.