Malware Document Scanner is a simple and effective C# tool that helps scan documents (like .pptx, .docx, and .pdf) for potential malware or suspicious content. The tool identifies common malware markers such as embedded macros, suspicious keywords, and strange file structures.
Whether you're a security enthusiast or a developer looking to secure documents, this tool can help you detect malware within your files and ensure your documents are clean and safe.
Suspicious Keyword Detection: Flags documents containing keywords like powershell, cmd.exe, and other common malware terms.
Macro/Embedded Object Detection: Scans for macros or embedded objects in .pptx and .docx files, which are often used for malware execution.
File Structure Anomalies: Looks for unusual file sizes or structural patterns that may indicate tampering or malware.
Simple and User-Friendly: Just input the file path, and the tool will analyze it and give you the result.
- Clone the repository To get started, clone the repo to your local machine:
git clone https://github.com/dionabazi/Malware-Document-Scanner.git 2. Build the project Open the solution in Visual Studio.
Build the project (Ctrl+Shift+B).
Run the project directly from Visual Studio.
Run the program.
Enter the full file path of the document you want to scan (it supports .pptx, .docx, and .pdf).
The tool will:
Scan the document for common malware indicators.
Output whether the document is clean or if it may be malicious.
π Scanning files for malware... π
Enter the file path to scan: C:\Users\user\Documents\example.pptx
Scanning file...
#
The Malware Document Scanner uses the following detection techniques:
Suspicious Keywords: Checks the content for dangerous keywords like powershell, cmd.exe, macro, etc.
Macros/Embedded Objects: Scans for embedded objects or macros that can potentially trigger malicious actions in documents.
File Structure: Flags documents with unusual file sizes or structural anomalies which are common indicators of tampered files.
C# β The primary programming language used.
File I/O β For reading and scanning .pptx, .docx, and .pdf files.
Regex β For matching suspicious patterns within file content.
Feel free to open issues and contribute to the project. If you find any bugs or want to add features, please create a pull request. All contributions are welcome!
This project is licensed under the MIT License β see the LICENSE file for details.
