Skip to content

Feature vector and Data distribution in Training, Validation and Development. #1

@Rajratnpranesh

Description

@Rajratnpranesh

Thank you for open sourcing your work. I am working on an implementation of GNN on protein PDB files. What I am looking for is the feature vector of each protein generated before the input in the dense layer of GNN. I am able to save the files in a pickle file. But the issue is if I put 20 PDB data then it shows 39 training samples,3 validation samples and 4 test samples. I can't figure out why and how this is happening as split = 0.7, 0.1, 0.2.
An explanation would be very helpful. Also if you can suggest me a way to save the feature vector of each protein generated before the input in the dense layer of GNN. I need to use the feature vector for further analysis.
Thank you for your time and consideration. I will wait for your response.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions