DOKK / manpages / debian 12 / libtypes-xsd-perl / Types::XSD.3pm.en
Types::XSD(3pm) User Contributed Perl Documentation Types::XSD(3pm)

Types::XSD - type constraints based on XML schema datatypes

   package Person;
   
   use Moo;
   use Types::XSD qw( PositiveInteger String );
   
   has name => (is => "ro", isa => String[ minLength => 1 ]);
   has age  => (is => "ro", isa => PositiveInteger);

Types::XSD is a type constraint library inspired by XML Schema, and built with Type::Library. It can be used as a type constraint library for Moo, Mouse or Moose, or used completely independently of any OO framework.

This module is an extension of Types::XSD::Lite (which has fewer type constraints, but fewer dependencies). For completeness, the type constraints and other features inherited from Types::XSD::Lite are documented below too.

This module defines the following type constraints based on the data types defined in XML Schema <http://www.w3.org/TR/xmlschema-2/>. (The names of the type constraints are the same as the XML Schema data types, but capitalization often differs.)

I've added some quick explainations of what each type is, but for details, see the XML Schema specification.

"AnyType"
As per "Any" from Types::Standard.
"AnySimpleType"
As per "Value" from Types::Standard.
"String"
As per "Str" from Types::Standard.
"NormalizedString"
A string containing no line breaks, carriage returns or tabs.
"Token"
Like "NormalizedString", but also no leading or trailing space, and no doubled spaces (i.e. not "/\s{2,}/").
"Language"
An RFC 3066 language code.
"Name"
Something that could be a valid XML element or attribute name. These roughly correspond to Perl identifiers but may also contain colons, hyphens and stops. (Digits, hyphens and stops are not allowed as the first character.)
"NmToken"
Slightly looser version of "Name"; allows digits, hyphens and stops in the first character.
"NmTokens"
Space-separated list of "NmToken".
"NCName"
Slightly tighter vesion of "Name"; disallows colons.
"Id"
Effectively the same as "NCName".
"IdRef"
Effectively the same as "NCName".
"IdRefs"
Space-separated list of "IdRef".
"Entity"
Effectively the same as "NCName".
"Entities"
Space-separated list of "Entity".
"Boolean"
Allows "true", "false", "1" and "0" (case-insensitively).

Gotcha: The string "false" evaluates to true in Perl. You probably want to use "Bool" from Types::Standard instead.

"Base64Binary"
Strings which are valid Base64 data. Allows whitespace.

Gotcha: If you parameterize this with "length", "maxLength" or "minLength", it is the length of the decoded string which will be checked.

"HexBinary"
Strings which are valid hexadecimal data. Disallows whitespace; disallows leading "0x".

Gotcha: If you parameterize this with "length", "maxLength" or "minLength", it is the length of the decoded string which will be checked.

"Float"
As per "Num" from Types::Standard.
"Double"
As per "Num" from Types::Standard.
"AnyURI"
Any absolute or relative URI. Effectively, any string at all!
"QName"
An XML QName; something that could be used as a valid element name in a namespaced XML document.

Gotcha: while "length", "maxLength" and "minLength" are allowed facets for parameterization, they are silently ignored, as per the specification!

"Notation"
Effectively the same as "QName". According to XML Schema, this is always supposed to be parameterized with an enumeration. But we don't enforce that.

Gotcha: while "length", "maxLength" and "minLength" are allowed facets for parameterization, they are silently ignored, as per the specification!

"Decimal"
Numbers possibly including a decimal point, but not allowing exponential notation (e.g. "3.14e-3").
"Integer"
As per "Int" from Types::Standard.
"NonPositiveInteger"
An "Integer" 0 or below.
"NegativeInteger"
An "Integer" -1 or below.
"Long"
An "Integer" between -9223372036854775808 and 9223372036854775807 (inclusive).
"Int"
An "Integer" between -2147483648 and 2147483647 (inclusive).
"Short"
An "Integer" between -32768 and 32767 (inclusive).
"Byte"
An "Integer" between -128 and 127 (inclusive).
"NonNegativeInteger"
An "Integer" 0 or above.
"PositiveInteger"
An "Integer" 1 or above.
"UnsignedLong"
A "NonNegativeInteger" between 0 and 18446744073709551615 (inclusive).
"UnsignedInt"
A "NonNegativeInteger" between 0 and 4294967295 (inclusive).
"UnsignedShort"
A "NonNegativeInteger" between 0 and 65535 (inclusive).
"UnsignedByte"
A "NonNegativeInteger" between 0 and 255 (inclusive).
"Duration"
An ISO 8601 duration.
"YearMonthDuration"
An ISO 8601 duration restricted to cover only years and months.
"DayTimeDuration"
An ISO 8601 duration restricted to cover only days, hours, minutes and seconds. (Note that this still permits durations of many years, as the days component is an arbitrary non-negative integer.)
"DateTime"
An ISO 8601 datetime with optional timezone.
"DateTimeStamp"
An ISO 8601 datetime with required timezone.
"Time"
An ISO 8601 time with optional timezone.
"Date"
An ISO 8601 date with optional timezone.
"GYearMonth"
An year-month pair with optional timezone.
"GYear"
An year with optional timezone.
"GMonthDay"
An month-day pair with optional timezone.
"GDay"
An day with optional timezone.
"GMonth"
An month with optional timezone.

Datatypes can be parameterized using the facets defined by XML Schema. For example:

   use Types::XSD qw( String Decimal PositiveInteger Token );
   
   my @sizes = qw( XS S M L XL XXL );
   
   has name   => (is => "ro", isa => String[ minLength => 1 ]);
   has price  => (is => "ro", isa => Decimal[ fractionDigits => 2 ]);
   has rating => (is => "ro", isa => PositiveInteger[ maxInclusive => 5 ]);
   has size   => (is => "ro", isa => Token[ enumeration => \@sizes ]);

The following facets exist, but not all facets are supported for all datatypes. (The module will croak if you try to use an unsupported facet.)

"enumeration"
An arrayref of allowable values. You should probably use Type::Tiny::Enum instead.
"pattern"
A regular expression that the value is expected to conform to. Use a normal Perl quoted regexp:

   Token[ pattern => qr{^[a-z]+$} ]
    
"whiteSpace"
The "whiteSpace" facet is ignored as I'm not entirely sure what it should do. It perhaps makes sense for coercions, but this module doesn't define any coercions.
"assertions"
An arrayref of arbitrary additional restrictions, expressed as strings of Perl code or coderefs operating on $_.

For example:

   Integer[
      assertions => [
         '$_ % 3 == 0',            # multiple of three, and...
         sub { is_nice($_) },      # is nice (whatever that means)
      ],
   ],
    

Strings of Perl code will result in faster-running type constraints.

"length", "maxLength", "minLength"
Restrict the length of a value. For example "Integer[length=>2]" allows 10, 99 and "-1", but not 100, 9 or "-10".

Types::XSD won't prevent you from making ridiculous constraints such as "String[ maxLength => 1, minLength => 2 ]".

Note that on "HexBinary" and "Base64Binary" types, the lengths apply to the decoded string. Length restrictions are silently ignored for "QName" and "Notation" because the W3C doesn't think you should care what length these datatypes are.

"maxInclusive", "minInclusive", "maxExclusive", "minExclusive"
Supported for numeric types and datetime/duration-related types.

Note that to be super-correct, the "{max,min}{Inclusive,Exclusive}" facets for numeric types are performed by passing the numbers through Math::BigInt or Math::BigFloat, so may be a little slow.

"totalDigits"
For a decimal (or type derived from decimals) specifies that the total number of digits for the value must be at most this number. Given "Decimal[ totalDigits => 3 ]", 1.23, 12.3, 123, 1.2 and 1 are all allowable; 1.234 is not. 1.230 is also not, but this may change in a future version.
"fractionDigits"
Like "totalDigits" but ignores digits before the decimal point.
"explicitTimezone"
May be "optional", "prohibited" or "required". For example:

   Time[ explicitTimezone => "prohibited" ]
    

This module also exports some convenience functions:

"dur_parse($str)"
Parse an xsd:duration string, returning a DateTime::Duration.
"dur_cmp($a, $b)"
Compare two strings conforming to the xsd:duration datatype to indicate which is the longer duration.

Returns -1 if $a is shorter. Returns 1 if $b is shorter. Returns 0 if the durations are identical. Returns undef if the comparison is indeterminate; for example, "P1Y" (one year) and "P365D" (365 days) are not necessarily identical - in leap years "P365D" is shorter.

"dt_cmp($type, $a, $b)"
Compare two datetime-like strings. For example, two "gYearMonth" strings can be compared using:

   dt_cmp(GYearMonth, "2009-02", "2010-10");
    

Both strings are expected to conform to the same datatype. It doesn't make much sense to compare them otherwise.

"dt_parse($type, $str)"
Parse a datetime-like string, returning a DateTime::Incomplete object. Note that DateTime::Incomplete objects are always returned, even if the datetime is potentially complete.

Please report any bugs to <http://rt.cpan.org/Dist/Display.html?Queue=Types-XSD>.

Type::Tiny, Types::XSD::Lite, Types::Standard.

  • <http://www.w3.org/TR/xmlschema-2/> Datatypes in XML Schema 1.0
  • <http://www.w3.org/TR/xmlschema11-2/> Datatypes in XML Schema 1.1

Toby Inkster <tobyink@cpan.org>.

This software is copyright (c) 2013-2014, 2021 by Toby Inkster.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.

THIS PACKAGE IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF MERCHANTIBILITY AND FITNESS FOR A PARTICULAR PURPOSE.

2022-09-28 perl v5.34.0