txt2phrases — Feature Enhancement Proposal

# txt2phrases — Feature Enhancement Proposal

Enhance `txt2phrases` to support more flexible input handling and compatibility with research workflows such as **pygetpapers**.  

This update will make the library capable of automatically processing research papers in varied directory structures, converting PDFs to text, and allowing both single-file and batch-folder input.

---

## Proposed Enhancements

### 1. pygetpapers Output Compatibility
- **Goal**: Enable `txt2phrases` to automatically detect and process the directory structure generated by `pygetpapers`.  
- **Why**: The current structure of `pygetpapers` outputs differs from standard input formats expected by `txt2phrases`.  
- **Expected Behavior**:  
  `txt2phrases` should intelligently navigate nested folders to find and process `.pdf` or `.txt` files.  


---

### 2. PDF → TXT Conversion Method
- **Goal**: Add a built-in method to convert `.pdf` files into `.txt` for downstream keyword extraction.  
- **Why**: Users should be able to directly process PDF research papers without manual text extraction.  

---

### 3. File and Folder Input Support

- **Goal**: Allow `txt2phrases` to work seamlessly with both single files and entire directories.  

- **Why**: This provides flexibility for users who want to analyze one document or batch-process an entire dataset.  


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

txt2phrases — Feature Enhancement Proposal #6

txt2phrases — Feature Enhancement Proposal

Proposed Enhancements

1. pygetpapers Output Compatibility

2. PDF → TXT Conversion Method

3. File and Folder Input Support

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

txt2phrases — Feature Enhancement Proposal #6

Description

txt2phrases — Feature Enhancement Proposal

Proposed Enhancements

1. pygetpapers Output Compatibility

2. PDF → TXT Conversion Method

3. File and Folder Input Support

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions