WIP: Add decimal to StringRules #440

bufdev · 2025-11-24T01:13:29Z

This would basically both deprecate usage of google.type.Decimal and actually enforce valid values with Protovalidate.

github-actions · 2025-11-24T01:13:40Z

The latest Buf updates on your PR. Results from workflow Buf CI / buf (pull_request).

Build	Format	Lint	Breaking	Updated (UTC)
`✅ passed`	`✅ passed`	`✅ passed`	`✅ passed`	Nov 26, 2025, 7:59 PM

hudlow · 2025-11-24T21:57:49Z

@bufdev Is it safe to assume that the goal is to align these rules to what could be represented in a SQL DECIMAL/NUMERIC column?

If so (and based on the desire to possibly extend to the full range of google.type.Decimal representations), I would expect that these validations would need to happen on a normalized representation such that 00001.10000, 0.0011e3, 1100e-3 would all validate as precision = 2, scale = 1 — is that right?

bufdev · 2025-11-24T22:01:35Z

Is it safe to assume that the goal is to align these rules to what could be represented in a SQL DECIMAL/NUMERIC column?

Yes

If so (and based on the desire to possibly extend to the full range of google.type.Decimal representations), I would expect that these validations would need to happen on a normalized representation such that 00001.10000, 0.0011e3, 1100e-3 would all validate as precision = 2, scale = 1 — is that right?

Probably not? I don't know, what do SQL databases generally accept? 00001.10000 is an interesting one, I would say this is just invalid - leading zeroes feels like an error. 0.10000 means something different than 0.1 in math, so I'd say the former has more precision. I would want to know what is generally accepted.

hudlow · 2025-11-25T07:13:09Z

From what I can tell, normalization prior to conversion is typical. Unfortunately, I don't have access to the relevant sections of ISO 9075, but this is runnable against all the engines on sqlfiddle.com and they all normalize the values without losing any magnitudinal information:

CREATE TABLE numbers (
  num NUMERIC(2, 1)
);

INSERT INTO numbers(num) VALUES (1.1);
INSERT INTO numbers(num) VALUES (0001.1000);
INSERT INTO numbers(num) VALUES (0.0011e3);
INSERT INTO numbers(num) VALUES (1100e-3);

SELECT * FROM numbers;

00001.10000 is an interesting one, I would say this is just invalid - leading zeroes feels like an error. 0.10000 means something different than 0.1 in math

Yeah, it's strange. google.type.Decimal isn't really specified, and is of no help here in terms of precedent.

Of course, one or more leading zeroes is a common signifier for octal; for example, ECMAScript and Go both interpret 077 (or 0077, etc) as 63. In ECMAScript, 077.0 is a syntax error; in Go, somewhat appallingly, it's 77.0. Go comes by this honestly though—C behaves the same way. Rust ignores leading zeroes in integer and float literals.

The inconsistencies here are an argument for rejecting numbers with leading zeroes on the grounds that someone might expect you to interpret them as octal and someone else might not.

My own pet peeve in the numeric literal space is the lack of overall limits on digit count. I think any spec that says if I give you an unbounded number of zeroes before or after the decimal place you have to parse it—or you have to select your own limit—is saliently deficient.

As far as treating trailing zeroes after the decimal point as indicators of precision, as is done for empirical quantities in science... it certainly bothers me when software calls things significant digits that aren't, but I can't recall ever encountering a programming language that actually treats numeric literals according to the rules of significant digits (where trailing zeroes after the decimal point are significant and trailing zeroes in a number without a decimal point are insignificant).

One reason for this is probably that you can't truncate digits at each step in a mathematical operation, so a programming language that followed the rules of significant digits would have to track the precision of numeric values in parallel to their magnitude.

In a greenfield context, I wouldn't be opposed to validating the significant digits of numeric literals with a predefined precision, but I think the precedent in the SQL-adjacent space of decimal literals has everything to do with whether a value's magnitude can be expressed without respect to inferring anything about its precision from the literal representation.

bufdev · 2025-11-25T14:37:19Z

Overly restrictive better than too loose to start if there's a choice - we can always loosen. Time to market is most important.

hudlow · 2025-11-25T21:41:23Z

Overly restrictive better than too loose to start if there's a choice - we can always loosen.

I'd say any change in strictness for a validation rule is a breaking change.

bufdev · 2025-11-26T00:00:21Z

Loosening is fine.

hudlow · 2025-11-26T06:32:56Z

I took a swing at an implementation, but hoping someone (@timostamm?) can help fill in some gaps for me:

I'm a little fuzzy on the testing regime.
I'm not at all sure I put the right rules in the right places for the meta-validations of rule values. I couldn't really find much precedent for doing this. @bufdev had employed a pattern that appeared to keep the meta-validation separate, but I couldn't get it to work in practice or correlate it to other precedent.

hudlow · 2025-11-26T06:34:30Z

proto/protovalidate/buf/validate/validate.proto

+      (predefined).cel = {
+        id: "string.decimal.precision_minimum"
+        message: "precision must be at least 1"
+        expression: "!has(rules.decimal.precision) || rules.decimal.precision > 0"
+      },
+      (predefined).cel = {
+        id: "string.decimal.scale_without_precision"
+        message: "if scale is set, precision must also be set"
+        expression: "!has(rules.decimal.scale) || has(rules.decimal.precision)"
+      },
+      (predefined).cel = {
+        id: "string.decimal.scale_less_than_precision"
+        message: "scale must be less than precision"
+        expression: "!has(rules.decimal.precision) || !has(rules.decimal.scale) || rules.decimal.scale < rules.decimal.precision"
+      },


@timostamm Do we have precedent for validations that do not use this and consequently could be checked at compile time?

When this, rule, and rules are declared, implementations set the appropriate CEL type. See https://github.com/bufbuild/protovalidate-go/blob/v1.0.1/cache.go#L160-L162. When the implementation compiles the expression, it can raise type errors.

timostamm · 2025-11-26T11:53:29Z

testing

This repository defines the validation rules and provides a test suite for implementations.
It does not test that validate.proto is structurally sound, or that its CEL expressions compile.

meta-validations of rule values

As in "if scale is set, precision must also be set"? The rule (buf.validate.message).oneof is a bit similar. It lets the user specify field names. If an unknown field name is specified, implementations must raise a compile error. This validation isn't specified in CEL, it's up to the implementation. Conformance tests.

timostamm · 2025-11-26T12:34:48Z

proto/protovalidate/buf/validate/validate.proto

+    //   // point, and at most four digits before the decimal point.
+    //   // "1000", "100.1", and "100" are valid, whereas "10000" and "1.111" are not.
+    //   string value = 1 [
+    //     (buf.validate.field).decimal.precision = 6,


(buf.validate.field).decimal is not a field.

(buf.validate.field).string is field StringRules string = 14.
(buf.validate.field).string.len is field uint64 len 19.
(buf.validate.field).string.decimal is field DecimalRules decimal = 36.

hudlow · 2025-11-26T20:49:12Z

Update:

Per @timostamm's advice, I've punted on meta-validation for now.
I added conformance tests.
(buf.validate.field).string.decimal = {} validates a decimal number is well formed:
- valid: 0, 12345, 12345.00000, 12345.12345
- invalid: 00, 01, 01.2, 0x0, 1e2, 1., .1, 1.2.3
(buf.validate.field).string.decimal.precision = 4 validates a number has <= 4 digits:
- valid: 1234, 1.234, 0.000
- invalid: 12345, 1.2345, 1234.0, 0.0000
(buf.validate.field).string.decimal.scale = 2 validates a number has <= 2 digits after the decimal point:
- valid: 0, 0.1, 12345.67, 0.00
- invalid: 1.000, 12345.678
(buf.validate.field).string.decimal = { precision: 6, scale: 2 } additionally validates that a number has no more than <= (6-2=)4 digits before the decimal point. This is consistent with SQL limitations for decimal numbers with a defined precision and scale:
- valid: 1234, 0.00, 1234.56
- invalid: 12345, 0.000
Ran the new conformance tests in protovalidate-es and all seems well.

@bufdev out of band, I think you expressed some doubt as to this as a string format. Some thoughts:

There is a structural decimal format as a part of the AEP project offering an alternative vision of how you could do this, but it's severely limited in precision because of the choice to use an int64 as the significand — it doesn't seem like a good fit for parity with SQL (where some databases support a maximum precision of 38, and others support precision in the thousands or tens of thousands).
You could make a case for a structural decimal number consisting of a pure-digit-string significand and an int32 exponent, but I think most people would find this much more annoying to consume.
There is also an argument for requiring trailing zeroes to indicate the scale which would have the nice effect of requiring numbers of a given scale to be fully-normalized such that equality of strings would indicate equality of magnitude and scale (albeit not necessarily precision).
- Or you could require no trailing zeroes which would fully normalize magnitude.
- In the end, it doesn't seem worth it to me to pursue either, because I think some people will find either annoying, and the benefit is minimal.

hudlow · 2025-12-01T15:02:21Z

proto/protovalidate/buf/validate/validate.proto

+//
+// TODO: Extend to the possible representations that google.type.Decimal allows?


I'm not certain we'll ever want to do this, but if we do, I think we'll want to move parsing to the libraries which would increase the scope a lot. I suspect we won't want to do that now in any case.

bufdev · 2025-12-01T15:09:32Z

Do not merge for now

hudlow · 2025-12-01T15:52:01Z

Converting to draft per @bufdev's comment.

Add decimal to StringRules

426d273

bufdev changed the title ~~Add decimal to StringRules~~ WIP: Add decimal to StringRules Nov 24, 2025

Add validations to decimal rules

778be60

hudlow reviewed Nov 26, 2025

View reviewed changes

timostamm reviewed Nov 26, 2025

View reviewed changes

hudlow added 2 commits November 26, 2025 12:35

Defer meta-validation and add tests

9e43b51

Fix bugs in decimal tests

793507f

hudlow requested a review from timostamm November 26, 2025 22:31

hudlow reviewed Dec 1, 2025

View reviewed changes

timostamm approved these changes Dec 1, 2025

View reviewed changes

hudlow marked this pull request as draft December 1, 2025 15:51

		//
		// TODO: Extend to the possible representations that google.type.Decimal allows?

WIP: Add decimal to StringRules #440

Are you sure you want to change the base?

WIP: Add decimal to StringRules #440

Uh oh!

Conversation

bufdev commented Nov 24, 2025

Uh oh!

github-actions bot commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hudlow commented Nov 24, 2025

Uh oh!

bufdev commented Nov 24, 2025

Uh oh!

hudlow commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bufdev commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hudlow commented Nov 25, 2025

Uh oh!

bufdev commented Nov 26, 2025

Uh oh!

hudlow commented Nov 26, 2025

Uh oh!

hudlow Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

timostamm Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

timostamm commented Nov 26, 2025

Uh oh!

timostamm Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

hudlow commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hudlow Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

bufdev commented Dec 1, 2025

Uh oh!

hudlow commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions bot commented Nov 24, 2025 •

edited

Loading

hudlow commented Nov 25, 2025 •

edited

Loading

bufdev commented Nov 25, 2025 •

edited

Loading

hudlow commented Nov 26, 2025 •

edited

Loading