Feature vector and Data distribution in Training, Validation and Development.

Thank you for open sourcing your work. I am working on an implementation of GNN on protein PDB files. What I am looking for is the feature vector of each protein generated before the input in the dense layer of GNN. I am able to save the files in a pickle file. But the issue is if I put 20 PDB data then it shows 39 training samples,3 validation samples and 4 test samples. I can't figure out why and how this is happening as split = 0.7, 0.1, 0.2. 
An explanation would be very helpful. Also if you can suggest me a way to save the feature vector of each protein generated before the input in the dense layer of GNN. I need to use the feature vector for further analysis.
Thank you for your time and consideration. I will wait for your response.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature vector and Data distribution in Training, Validation and Development. #1

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature vector and Data distribution in Training, Validation and Development. #1

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions