Add PDF syntax to Rouge by petervwyatt · Pull Request #2058 · rouge-ruby/rouge

petervwyatt · 2024-07-02T06:21:24Z

Please accept this lexer for PDF syntax (a.k.a. "COS syntax").

PDF (Portable Document Format) is an object-based declarative page description language that, in reality, is a random access, binary (non-text) format. It is formally defined by ISO 32000-2:2020 and corrected by errata (please do not refer to outdated legacy Adobe documentation!). However, with care text-centric PDFs (full or portions) can be created such as might be used in documentation. This token-based, forward lexing lexer is not intended to be used with binary real-world PDFs as that is not how real PDFs need to be lexed (this will also likely generate Ruby UTF-8 errors anyway!).

We wish to leverage this Rouge PDF parser upstream in current and future PDF ISO standards and specifications based on AsciiDoc via Metanorma for use with the many code fragment examples in the documentation.

Needs to be treated as binary for xref to remain valid

petervwyatt · 2024-07-02T06:30:06Z

BTW the PDFs added are fully functional PDFs that will work in products such as Adobe Acrobat Reader. Likely just need to rename with a .pdf extension. Also be careful with EOL conversions out of Github as PDFs are binary files! Normally I'd control this using .gitattributes file with *.pdf binary but because Rouge doesn't use file extensions this isn't possible.

petervwyatt · 2024-07-05T00:37:16Z

Failure of linelint is against the 2 different sample functional PDF files in lib/rouge/demos/pdf and spec/visual/samples/pdf. This is because PDFs are not required to have an EOL on their last line (after the %%EOF) and the Rouge grammar must support this, which is why the samples are the way they are. If this is critical to fix then the EOL can be added but there will then be no test to ensure the grammar successfully processes PDFs without the EOL.

Added EOL to last line of PDF to pass linelint CI check used by Rouge. This is not required by real PDF files.

ronaldtse · 2025-09-10T07:22:13Z

Maintainers (@pyrmont @tancnle @gfx ), is it possible to help review this? This would greatly help those of us who regularly work with PDF syntaxes.

Thank you!

pyrmont · 2025-09-10T08:10:41Z

I'm sorry, @ronaldtse. I'm no longer a maintainer on this project.

ronaldtse · 2025-09-10T08:27:03Z

Apologies for unnecessarily tagging you @pyrmont , thank you for the quick response!

lib/rouge/lexers/pdf.rb

Co-authored-by: Jeanine Adkisson <jeanine.adkisson@gmail.com>

petervwyatt added 8 commits July 2, 2024 01:03

Initial PDF COS rouge lexer

02b4e29

Update pdf.rb

82785e3

Create demo PDF (functional)

e54e1d3

Needs to be treated as binary for xref to remain valid

Update pdf.rb

91d499c

Add basic spec checker

062647e

Fixups

9cf372f

Altered tokens for better color

2488909

More complex PDF for visual test

a8e8c8b

petervwyatt added 2 commits July 5, 2024 10:58

Added EOL to last line of PDF

643179c

Added EOL to last line of PDF to pass linelint CI check used by Rouge. This is not required by real PDF files.

Merge branch 'rouge-ruby:master' into feature.pdf

bf5f842

ronaldtse mentioned this pull request Sep 10, 2025

Support PDF syntax highlighting via enhanced Rouge metanorma/mn-samples-pdfa#14

Open

Merge branch 'rouge-ruby:main' into feature.pdf

3d374b0

jneen reviewed Mar 2, 2026

View reviewed changes

lib/rouge/lexers/pdf.rb Outdated Show resolved Hide resolved

jneen reviewed Mar 2, 2026

View reviewed changes

lib/rouge/lexers/pdf.rb Outdated Show resolved Hide resolved

jneen reviewed Mar 2, 2026

View reviewed changes

lib/rouge/lexers/pdf.rb Outdated Show resolved Hide resolved

petervwyatt and others added 5 commits March 16, 2026 15:20

Update lib/rouge/lexers/pdf.rb

7509fe3

Co-authored-by: Jeanine Adkisson <jeanine.adkisson@gmail.com>

Update lib/rouge/lexers/pdf.rb

3bd9155

Co-authored-by: Jeanine Adkisson <jeanine.adkisson@gmail.com>

Update lib/rouge/lexers/pdf.rb

4914c82

Co-authored-by: Jeanine Adkisson <jeanine.adkisson@gmail.com>

Merge branch 'rouge-ruby:main' into feature.pdf

2f803a3

Fix spelling. Ensure PERIOD in "%PDF-x.y". Comment added

43f3d71

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PDF syntax to Rouge#2058

Add PDF syntax to Rouge#2058
petervwyatt wants to merge 16 commits intorouge-ruby:mainfrom
petervwyatt:feature.pdf

petervwyatt commented Jul 2, 2024

Uh oh!

petervwyatt commented Jul 2, 2024

Uh oh!

petervwyatt commented Jul 5, 2024

Uh oh!

ronaldtse commented Sep 10, 2025

Uh oh!

pyrmont commented Sep 10, 2025

Uh oh!

ronaldtse commented Sep 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

petervwyatt commented Jul 2, 2024

Uh oh!

petervwyatt commented Jul 2, 2024

Uh oh!

petervwyatt commented Jul 5, 2024

Uh oh!

ronaldtse commented Sep 10, 2025

Uh oh!

pyrmont commented Sep 10, 2025

Uh oh!

ronaldtse commented Sep 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants