improve StringForm implementation #1619

mmatera · 2026-01-11T18:58:48Z

Going over mathics.builtin.forms, I realized that the current implementation of StringForm had some issues with poorly formed templates. The implementation now is slightly more robust and is located in mathics.eval.string.
Fixing this, also come up some issues in the messages for ContainsOnly and NIntegrate, which are also handled here.

mmatera · 2026-01-11T18:59:41Z

mathics/builtin/numbers/calculus.py

-            + "`, `".join(list(methods))
-            + "`}. Using `Automatic`"
+            + "built-in method name in {\\[RawBackquote]"
+            + "\\[RawBackquote], \\[RawBackquote]".join(list(methods))


'' here are interpreted as placeholders in StringForm`. Use the named character instead.

rocky · 2026-01-11T19:43:56Z

The implementation now is slightly more robust and is located in mathics.eval.string.
Fixing this, also come up some issues in the messages for ContainsOnly and NIntegrate, which are also handled here.

This is great. Would it be possible to add some tests that demonstrate this?

rocky · 2026-01-12T13:26:31Z

mathics/eval/strings.py

+    # are parsed.
+    # "\\`" must be parsed as "\\`" in order this
+    # works properly, but the parser converts `\\`
+    # into `\`.


Are we talking about the Mathics3 scanner or the Mathics3 parser? (Both terms are used)

Can you provide an example using mathics3-tokens that compares against CodeTokenize?

(Perhaps before release, I will be getting the Mathics3 token names to agree more)

Suppose you enter in the CLI

In[1]:="\\\`"

This is what happends in WMA:

In[1]:= A="\\\`" Out[1]= \\` In[2]:= StringTake[A,{1}] Out[2]= \ In[3]:= StringTake[A,{2}] Out[3]= \`

and this in Mathics:

In[1]:= A="\\\`" Out[1]= "\\`" In[2]:= StringTake[A,{1}] Out[2]= "\" In[3]:= StringTake[A,{2}] Out[3]= "\" In[4]:= StringTake[A,{3}] Out[4]= "`"

The point is if I try to escape the backslash before a backquote chraracter, the result is a string with a (Python) value '`', so the backslash fails to be escaped. In some way is something marginal, and can be avoided by concatenating the strings. When I have a more close description, I am going to put an issue discussing this.

Suppose you enter in the CLI

In[1]:="\\\`"

This is what happends in WMA:

In[1]:= A="\\\`" Out[1]= \\` In[2]:= StringTake[A,{1}] Out[2]= \ In[3]:= StringTake[A,{2}] Out[3]= \`

and this in Mathics:

In[1]:= A="\\\`" Out[1]= "\\`" In[2]:= StringTake[A,{1}] Out[2]= "\" In[3]:= StringTake[A,{2}] Out[3]= "\" In[4]:= StringTake[A,{3}] Out[4]= "`"

The point is if I try to escape the backslash before a backquote chraracter, the result is a string with a (Python) value '`', so the backslash fails to be escaped. In some way is something marginal, and can be avoided by concatenating the strings. When I have a more close description, I am going to put an issue discussing this.

Ok. Thanks for the information. So the problem might not necessarily be in Mathics3's scanner. It might not even be in Mathics3's parser, either, but somewhere else, such as in boxing routines.

And when we know what's up, then we can decide whether to fix it or write workaround code as we have here. (Or maybe this is where the code should go.)

Let's mark this as a draft until the tests and a decision about how to handle this kind of thing are discussed.

BTW, in the back of my mind, when I mentioned going, moving from resolving InputForm to OutputForm, I had thought: there are probably a couple of smaller, simpler data forms that we could and probably should handle before OutputForm and which will probably feed into OutputForm.

I hesitated on that only because I sensed an anxiety to get OutputForm overhauled (along with the other general Forms like StandardForm). Rest assured, they will get done.

But there will be a lot less chaos if we start small and work our way up.

Ok. Thanks for the information. So the problem might not necessarily be in Mathics3's scanner. It might not even be in Mathics3's parser, either, but somewhere else, such as in boxing routines.

And when we know what's up, then we can decide whether to fix it or write workaround code as we have here. (Or maybe this is where the code should go.)

Let's mark this as a draft until the tests and a decision about how to handle this kind of thing are discussed.

I think that fixing this issue with the escape sequences does not affect this PR: In the end, an expression of the form StringForm["... \\` ...", ...] would be a corner case. Right now, what is needed for the existing and foreseeable code is the ability to escape the backquote, which this PR handles. Tests for this were already added.

BTW, in the back of my mind, when I mentioned going, moving from resolving InputForm to OutputForm, I had thought: there are probably a couple of smaller, simpler data forms that we could and probably should handle before OutputForm and which will probably feed into OutputForm.

My reason to go over OutputForm is that it decouples the MakeBoxes processing from most of the other tests. I am close to cover all the cases to switch format_element to use it instead of the current format sequence.

I hesitated on that only because I sensed an anxiety to get OutputForm overhauled (along with the other general Forms like StandardForm). Rest assured, they will get done.

But this is happening along with the work with OutputForm: issues with these other special forms are coming up and handled as I check the new OutputForm routine against the existing code. In comparison, this PR is quite simple. When this PR gets merged, the next step would be to rework NumberForm, which seems to be a little more involved.

But there will be a lot less chaos if we start small and work our way up.

* avoid '<mo></mo>' in MathMLForm (empty string operator) * Fix error handing in eval_StringForm_MakeBoxes * Improve StringForm documentation

"<mo></mo>" cannot be parsed in Mathics-Django browser interface, so avoid to convert `""` into `<mo></mo>`.

* Documentation tested on Mathics-Django and LaTeX * ruff

mmatera · 2026-01-12T17:22:23Z

@rocky, I guess now this is ready for review. Please check the docstring.

rocky · 2026-01-12T21:41:04Z

@rocky, I guess now this is ready for review. Please check the docstring.

Great! I'll look at Tuesday morning.

improve StringForm implementation

3669274

mmatera commented Jan 11, 2026

View reviewed changes

mmatera added 2 commits January 12, 2026 09:35

adding tests. Handlind escaped backquotes.

1f0c833

Merge remote-tracking branch 'origin/master' into StringForm

2644dd1

mmatera force-pushed the StringForm branch from 163136c to 2644dd1 Compare January 12, 2026 12:41

another trailing dot

7d635f9

rocky reviewed Jan 12, 2026

View reviewed changes

mmatera added 6 commits January 12, 2026 13:27

* Fix typo in documentation

9e42445

* avoid '<mo></mo>' in MathMLForm (empty string operator) * Fix error handing in eval_StringForm_MakeBoxes * Improve StringForm documentation

Fix MathMLForm for empty operators

536dd4c

"<mo></mo>" cannot be parsed in Mathics-Django browser interface, so avoid to convert `""` into `<mo></mo>`.

merge

1a4f1d6

Is MakeBoxes! return the template string instead

924096b

and adjust back the doctest

3d10835

* Fully working.

8beb109

* Documentation tested on Mathics-Django and LaTeX * ruff

mmatera mentioned this pull request Jan 12, 2026

Escape sequences in string parsing #1622

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

improve StringForm implementation #1619

improve StringForm implementation #1619

Uh oh!

mmatera commented Jan 11, 2026

Uh oh!

mmatera Jan 11, 2026

Uh oh!

rocky commented Jan 11, 2026

Uh oh!

rocky Jan 12, 2026 •

edited

Loading

Uh oh!

mmatera Jan 12, 2026

Uh oh!

rocky Jan 12, 2026 •

edited

Loading

Uh oh!

mmatera Jan 12, 2026

Uh oh!

mmatera commented Jan 12, 2026

Uh oh!

rocky commented Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

improve StringForm implementation #1619

Are you sure you want to change the base?

improve StringForm implementation #1619

Uh oh!

Conversation

mmatera commented Jan 11, 2026

Uh oh!

mmatera Jan 11, 2026

Choose a reason for hiding this comment

Uh oh!

rocky commented Jan 11, 2026

Uh oh!

rocky Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mmatera Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

rocky Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mmatera Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

mmatera commented Jan 12, 2026

Uh oh!

rocky commented Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rocky Jan 12, 2026 •

edited

Loading

rocky Jan 12, 2026 •

edited

Loading