Skip to content

MLE-26652 Added docs for additional Flux API dependencies#578

Merged
rjrudin merged 1 commit intodevelopfrom
feature/2.0-docs-update
Jan 21, 2026
Merged

MLE-26652 Added docs for additional Flux API dependencies#578
rjrudin merged 1 commit intodevelopfrom
feature/2.0-docs-update

Conversation

@rjrudin
Copy link
Contributor

@rjrudin rjrudin commented Jan 21, 2026

Applying this to develop branch as the Tika version is slighty different.

Applying this to develop branch as the Tika version is slighty different.
Copilot AI review requested due to automatic review settings January 21, 2026 18:45
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds documentation for additional Apache Tika dependencies required when using Flux's text extraction capabilities.

Changes:

  • Added a new "Additional dependencies" section documenting Apache Tika library requirements
  • Specified Tika module dependencies for Microsoft Office and PDF file text extraction
  • Included reference to additional supported formats

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

### Additional dependencies

If you wish to use Flux's text extraction capabilities, you will need to include the following Apache Tika libraries
as dependencies in your project:
Copy link

Copilot AI Jan 21, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Missing period at the end of the sentence.

Suggested change
as dependencies in your project:
as dependencies in your project.

Copilot uses AI. Check for mistakes.
- For extracting text from Microsoft Office files, include `org.apache.tika:tika-parser-microsoft-module:3.2.3`.
- For extracting text from PDF files, include `org.apache.tika:tika-parser-pdf-module:3.2.3`.

Apache Tika supports [additional formats](https://tika.apache.org/3.2.3/formats.html)
Copy link

Copilot AI Jan 21, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The hardcoded version '3.2.3' in the URL may become outdated when Tika is upgraded. Consider using a version-agnostic URL or adding a note about version-specific documentation.

Suggested change
Apache Tika supports [additional formats](https://tika.apache.org/3.2.3/formats.html)
Apache Tika supports [additional formats](https://tika.apache.org/formats.html)

Copilot uses AI. Check for mistakes.
@rjrudin rjrudin merged commit 9d5c191 into develop Jan 21, 2026
3 checks passed
@rjrudin rjrudin deleted the feature/2.0-docs-update branch January 21, 2026 19:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants