ProSECFPs

ProSECFPs is a tool for generating representations of protein sequences. This repository provides a Python script for computing ProSECFP descriptors using a predefined environment with all required dependencies.

Video explanation 🎥

An overview lecture on ProSECFPs and related concepts, delivered by Prof. Villoutreix, is available here: https://www.youtube.com/watch?v=K3wT_33e2Ao

Citation 📄

If you use ProSECFPs in your research, please cite the following paper: ProSECFPs: A Novel Fingerprint-Based Protein Representation Method for Missense Mutation Pathogenicity Prediction Clarissa Poles, Miriana Di Stefano, Lisa Piazza, Giulia Bononi, Giulio Poli, Marco Macchia, Tiziano Tuccinardi, and Antonio Giordano Journal of Chemical Information and Modeling 2025 65 (24), 13478-13492 DOI: 10.1021/acs.jcim.5c02437

Installation

To set up the required environment, use the provided YAML file:

conda env create -f prosecfps_env.yml Then, activate the environment:

conda activate prosecfps_env

Usage

Run the prediction script with Python:

python ProSECFPs.py -in input.csv -out output.csv -nj 1

Input

The input file must be a CSV containing a column named Sequences, which represents the protein sequences. A sample CSV file (input.csv) is included in the repository for testing purposes, along with the amino acid descriptor dataset (descriptors.dump) required for computing the ProSECFP representations.

Output

The script generates a CSV file containing the ProSECFP descriptors for each protein sequence. The descriptors are computed using the C-PSECFP variant, with an iteration radius of 12 and a representation vector length of 1024.

Dependencies

All necessary dependencies are included in prosecfps_env.yml.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ProSECFPs

Video explanation 🎥

Citation 📄

Installation

Usage

Input

Output

Dependencies

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
ProSECFPs.py		ProSECFPs.py
README.md		README.md
descriptors.dump		descriptors.dump
input.csv		input.csv
prosecfps_env.yml		prosecfps_env.yml

MMVSL/ProSECFPs

Folders and files

Latest commit

History

Repository files navigation

ProSECFPs

Video explanation 🎥

Citation 📄

Installation

Usage

Input

Output

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages