Introducing the Domains of the poly module¶

This page introduces the idea of the “domains” that are used in SymPy’s sympy.polys module. The emphasis is on introducing how to use the domains directly and on understanding how they are used internally as part of the Poly class. This is a relatively advanced topic so for a more introductory understanding of the Poly class and the sympy.polys module it is recommended to read Basic functionality of the module instead. The reference documentation for the domain classes is in Reference docs for the Poly Domains. Internal functions that make use of the domains are documented in Internals of the Polynomial Manipulation Module.

What are the domains?¶

For most users the domains are only really noticeable in the printed output of a Poly:

>>> from sympy import Symbol, Poly
>>> x = Symbol('x')
>>> Poly(x**2 + x)
Poly(x**2 + x, x, domain='ZZ')
>>> Poly(x**2 + x/2)
Poly(x**2 + 1/2*x, x, domain='QQ')

We see here that one Poly has domain ZZ representing the integers and the other has domain QQ representing the rationals. These indicate the “domain” from which the coefficients of the polynomial are drawn.

From a high-level the domains represent formal concepts such as the set of integers \(\mathbb{Z}\) or rationals \(\mathbb{Q}\). The word “domain” here is a reference to the mathematical concept of an integral domain.

Internally the domains correspond to different computational implementations and representations of the expressions that the polynomials correspond to. The Poly object itself has an internal representation as a list of coefficients and a domain attribute representing the implementation of those coefficients:

>>> p = Poly(x**2 + x/2)
>>> p
Poly(x**2 + 1/2*x, x, domain='QQ')
>>> p.domain
QQ
>>> p.rep
DMP([1, 1/2, 0], QQ, None)
>>> p.rep.rep
[1, 1/2, 0]
>>> type(p.rep.rep[0])  
<class 'sympy.external.pythonmpq.PythonMPQ'>

Here the domain is QQ which represents the implementation of the rational numbers in the domain system. The Poly instance itself has a Poly.domain attribute QQ and then a list of PythonMPQ coefficients where PythonMPQ is the class that implements the elements of the QQ domain. The list of coefficients [1, 1/2, 0] gives a standardised low-level representation of the polynomial expression (1)*x**2 + (1/2)*x + (0).

This page looks at the different domains that are defined in SymPy, how they are implemented and how they can be used. It introduces how to use the domains and domain elements directly and explains how they are used internally as part of Poly objects. This information is more relevant for development in SymPy than it is for users of the sympy.polys module.

Representing expressions symbolically¶

There are many different ways that a mathematical expression can be represented symbolically. The purpose of the polynomial domains is to provide suitable implementations for different classes of expressions. This section considers the basic approaches to the symbolic representation of mathematical expressions: “tree”, “dense polynomial” and “sparse polynomial”.

Tree representation¶

The most general representation of symbolic expressions is as a tree and this is the representation used for most ordinary SymPy expressions which are instances of Expr (a subclass of Basic). We can see this representation using the srepr() function:

>>> from sympy import Symbol, srepr
>>> x = Symbol('x')
>>> e = 1 + 1/(2 + x**2)
>>> e
1 + 1/(x**2 + 2)
>>> print(srepr(e))
Add(Integer(1), Pow(Add(Pow(Symbol('x'), Integer(2)), Integer(2)), Integer(-1)))

Here the expression e is represented as an Add node which has two children 1 and 1/(x**2 + 2). The child 1 is represented as an Integer and the other child is represented as a Pow with base x**2 + 2 and exponent 1. Then x**2 + 2 is represented as an Add with children x**2 and 2 and so on. In this way the expression is represented as a tree where the internal nodes are operations like Add, Mul, Pow and so on and the leaf nodes are atomic expression types like Integer and Symbol. See Advanced Expression Manipulation for more about this representation.

The tree representation is core to the architecture of Expr in SymPy. It is a highly flexible representation that can represent a very wide range of possible expressions. It can also represent equivalent expressions in different ways e.g.:

>>> e = x*(x + 1)
>>> e
x*(x + 1)
>>> e.expand()
x**2 + x

These two expression although equivalent have different tree representations:

>>> print(srepr(e))
Mul(Symbol('x'), Add(Symbol('x'), Integer(1)))
>>> print(srepr(e.expand()))
Add(Pow(Symbol('x'), Integer(2)), Symbol('x'))

Being able to represent the same expression in different ways is both a strength and a weakness. It is useful to be able to convert an expression in to different forms for different tasks but having non-unique representations makes it hard to tell when two expressions are equivalent which is in fact very important for many computational algorithms. The most important task is being able to tell when an expression is equal to zero which is undecidable in general (Richardon’s theorem) but is decidable in many important special cases.

DUP representation¶

Restricting the set of allowed expressions to special cases allows for much more efficient symbolic representations. As we already saw Poly can represent a polynomial as a list of coefficients. This means that an expression like x**4 + x + 1 could be represented simply as [1, 0, 0, 1, 1]. This list of coefficients representation of a polynomial expression is known as the “dense univariate polynomial” (DUP) representation. Working within that representation algorithms for multiplication, addition and crucially zero-testing can be much more efficient than with the corresponding tree representations. We can see this representation from a Poly instance by looking it its rep.rep attribute:

>>> p = Poly(x**4 + x + 1)
>>> p.rep.rep
[1, 0, 0, 1, 1]

In the DUP representation it is not possible to represent the same expression in different ways. There is no distinction between x*(x + 1) and x**2 + x because both are just [1, 1, 0]. This means that comparing two expressions is easy: they are equal if and only if all of their coefficients are equal. Zero-testing is particularly easy: the polynomial is zero if and only if all coefficients are zero (of course we need to have easy zero-testing for the coefficients themselves).

We can make functions that operate on the DUP representation much more efficiently than functions that operate on the tree representation. Many operations with standard sympy expressions are in fact computed by converting to a polynomial representation and then performing the calculation. An example is the factor() function:

>>> from sympy import factor
>>> e = 2*x**3 + 10*x**2 + 16*x + 8
>>> e
2*x**3 + 10*x**2 + 16*x + 8
>>> factor(e)
2*(x + 1)*(x + 2)**2

Internally factor() will convert the expression from the tree representation into the DUP representation and then use the function dup_factor_list:

>>> from sympy import ZZ
>>> from sympy.polys.factortools import dup_factor_list
>>> p = [ZZ(2), ZZ(10), ZZ(16), ZZ(8)]
>>> p
[2, 10, 16, 8]
>>> dup_factor_list(p, ZZ)
(2, [([1, 1], 1), ([1, 2], 2)])

There are many more examples of functions with dup_* names for operating on the DUP representation that are documented in Internals of the Polynomial Manipulation Module. There are also functions with the dmp_* prefix for operating on multivariate polynomials.

DMP representation¶

A multivariate polynomial (a polynomial in multiple variables) can be represented as a polynomial with coefficients that are themselves polynomials. For example x**2*y + x**2 + x*y + y + 1 can be represented as polynomial in x where the coefficients are themselves polynomials in y i.e.: (y + 1)*x**2 + (y)*x + (y+1). Since we can represent a polynomial with a list of coefficients a multivariate polynomial can be represented with a list of lists of coefficients:

>>> from sympy import symbols
>>> x, y = symbols('x, y')
>>> p = Poly(x**2*y + x**2 + x*y + y + 1)
>>> p
Poly(x**2*y + x**2 + x*y + y + 1, x, y, domain='ZZ')
>>> p.rep.rep
[[1, 1], [1, 0], [1, 1]]

This list of lists of (lists of…) coefficients representation is known as the “dense multivariate polynomial” (DMP) representation.

Sparse polynomial representation¶

Instead of lists we can use a dict mapping nonzero monomial terms to their coefficients. This is known as the “sparse polynomial” representation. We can see what this would look like using the as_dict() method:

>>> Poly(7*x**20 + 8*x + 9).rep.rep
[7, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 8, 9]
>>> Poly(7*x**20 + 8*x + 9).as_dict()
{(0,): 9, (1,): 8, (20,): 7}

The keys of this dict are the exponents of the powers of x and the values are the coefficients so e.g. 7*x**20 becomes (20,): 7 in the dict. The key is a tuple so that in the multivariate case something like 4*x**2*y**3 can be represented as (2, 3): 4. The sparse representation can be more efficient as it avoids the need to store and manipulate the zero coefficients. With a large number of generators (variables) the dense representation becomes particularly inefficient and it is better to use the sparse representation:

>>> from sympy import prod
>>> gens = symbols('x:10')
>>> gens
(x0, x1, x2, x3, x4, x5, x6, x7, x8, x9)
>>> p = Poly(prod(gens))
>>> p
Poly(x0*x1*x2*x3*x4*x5*x6*x7*x8*x9, x0, x1, x2, x3, x4, x5, x6, x7, x8, x9, domain='ZZ')
>>> p.rep.rep
[[[[[[[[[[1, 0], []], [[]]], [[[]]]], [[[[]]]]], [[[[[]]]]]], [[[[[[]]]]]]], [[[[[[[]]]]]]]], [[[[[[[[]]]]]]]]], [[[[[[[[[]]]]]]]]]]
>>> p.as_dict()
{(1, 1, 1, 1, 1, 1, 1, 1, 1, 1): 1}

The dict representation shown in the last output maps from the monomial which is represented as a tuple of powers ((1, 1, 1, ...) i.e. x0**1 * x1**1, ...) to the coefficient 1. Compared to the DMP representation we have a much more flattened data structure: it is a dict with only one key and value. Algorithms for working with sparse representations would likely be much more efficient than dense algorithms for this particular example polynomial.

SymPy’s polynomial module has implementations of polynomial expressions based on both the dense and sparse representations. There are also other implementations of different special classes of expressions that can be used as the coefficients of those polynomials. The rest of this page discusses what those representations are and how to use them.

Basic usage of domains¶

Several domains are predefined and ready to be used such as ZZ and QQ which represent the ring of integers \(\mathbb{Z}\) and the field of rationals \(\mathbb{Q}\). The Domain object is used to construct elements which can then be used in ordinary arithmetic operations.:

>>> from sympy import ZZ
>>> z1 = ZZ(2)
>>> z1
2
>>> z1 + z1
4
>>> type(z1)  
<class 'int'>
>>> z1 in ZZ
True

The basic operations +, -, and * for addition, subtraction and multiplication will work for the elements of any domain and will produce new domain elements. Division with / (Python’s “true division” operator) is not possible for all domains and should not be used with domain elements unless the domain is known to be a field. For example dividing two elements of ZZ gives a float which is not an element of ZZ:

>>> z1 / z1
1.0
>>> type(z1 / z1)  
<class 'float'>
>>> ZZ.is_Field
False

Most domains representing non-field rings allow floor and modulo division (remainder) with Python’s floor division // and modulo division % operators. For example with ZZ:

>>> z1 // z1
1
>>> z1 % z1
0

The QQ domain represents the field of rational numbers and does allow division:

>>> from sympy import QQ
>>> q1 = QQ(1, 2)
>>> q1
1/2
>>> q2 = QQ(2, 3)
>>> q2
2/3
>>> q1 / q2
3/4
>>> type(q1)  
<class 'sympy.external.pythonmpq.PythonMPQ'>

In general code that is expected to work with elements of an arbitrary domain should not use the division operators /, // and %. Only the operators +, -, * and ** (with nonnegative integer exponent) should be assumed to work with arbitrary domain elements. All other operations should be accessed as functions from the Domain object:

>>> ZZ.quo(ZZ(5), ZZ(3))  # 5 // 3
1
>>> ZZ.rem(ZZ(5), ZZ(3))  # 5 % 3
2
>>> ZZ.div(ZZ(5), ZZ(3))  # divmod(5, 3)
(1, 2)
>>> QQ.div(QQ(5), QQ(3))
(5/3, 0)

The exquo() function is used to compute an exact quotient. This is the analogue of a / b but where the division is expected to be exact (with no remainder) or an error will be raised:

>>> QQ.exquo(QQ(5), QQ(3))
5/3
>>> ZZ.exquo(ZZ(4), ZZ(2))
2
>>> ZZ.exquo(ZZ(5), ZZ(3))
Traceback (most recent call last):
...
ExactQuotientFailed: 3 does not divide 5 in ZZ

The exact methods and attributes of the domain elements are not guaranteed in general beyond the basic arithmetic operations. It should not be presumed that e.g. ZZ will always be of type int. If gmpy or gmpy2 is installed then the mpz or mpq types are used instead for ZZ and QQ:

>>> from sympy import ZZ, QQ
>>> ZZ(2)  
mpz(2)
>>> QQ(2, 3)  
mpq(2, 3)

The mpz type is faster than Python’s standard int type for operations with large integers although for smaller integers the difference is not so significant. The mpq type representing rational numbers is implemented in C rather than Python and is many times faster than the pure Python implementation of QQ that is used when gmpy is not installed.

In general the Python type used for the elements of a domain can be checked from the dtype attribute of the domain. When gmpy is installed the dtype for ZZ is \(mpz\) which is not an actual type and can not be used with \(isinstance\). For this reason the of_type() method can be used to check if an object is an element of dtype.:

>>> z = ZZ(2)
>>> type(z)  
<class 'int'>
>>> ZZ.dtype  
<class 'int'>
>>> ZZ.of_type(z)
True

Domain elements vs sympy expressions¶

Note that domain elements are not of the same type as ordinary sympy expressions which are subclasses of Expr such as Integer. Ordinary sympy expressions are created with the sympify() function.:

>>> from sympy import sympify
>>> z1_sympy = sympify(2)  # Normal sympy object
>>> z1_sympy
2
>>> type(z1_sympy)
<class 'sympy.core.numbers.Integer'>
>>> from sympy import Expr
>>> isinstance(z1_sympy, Expr)
True

It is important when working with the domains not to mix sympy expressions with domain elements even though it will sometimes work in simple cases. Each domain object has the methods to_sympy() and from_sympy() for converting back and forth between sympy expressions and domain elements:

>>> z_sympy = sympify(2)
>>> z_zz = ZZ.from_sympy(z_sympy)
>>> z_zz
2
>>> type(z_sympy)
<class 'sympy.core.numbers.Integer'>
>>> type(z_zz)  
<class 'int'>
>>> ZZ.to_sympy(z_zz)
2
>>> type(ZZ.to_sympy(z_zz))
<class 'sympy.core.numbers.Integer'>

Any particular domain will only be able to represent some sympy expressions so conversion will fail if the expression can not be represented in the domain:

>>> from sympy import sqrt
>>> e = sqrt(2)
>>> e
sqrt(2)
>>> ZZ.from_sympy(e)
Traceback (most recent call last):
...
CoercionFailed: expected an integer, got sqrt(2)

We have already seen that in some cases we can use the domain object itself as a constructor e.g. QQ(2). This will generally work provided the arguments given are valid for the dtype of the domain. Although it is convenient to use this in interactive sessions and in demonstrations it is generally better to use the from_sympy() method for constructing domain elements from sympy expressions (or from objects that can be sympified to sympy expressions).

It is important not to mix domain elements with other Python types such as int, float, as well as standard sympy Expr expressions. When working in a domain, care should be taken as some Python operations will do this implicitly. for example the sum function will use the regular int value of zero so that sum([a, b]) is effectively evaluated as (0 + a) + b where 0 is of type int.

Every domain is at least a ring if not a field and as such is guaranteed to have two elements in particular corresponding to \(1\) and \(0\). The domain object provides domain elements for these as the attributes one and zero. These are useful for something like Python’s sum function which allows to provide an alternative object as the “zero”:

>>> ZZ.one
1
>>> ZZ.zero
0
>>> sum([ZZ(1), ZZ(2)])  # don't do this (even it sometimes works)
3
>>> sum([ZZ(1), ZZ(2)], ZZ.zero) # provide the zero from the domain
3

A standard pattern then for performing calculations in a domain is:

Start with sympy Expr instances representing expressions.
Choose an appropriate domain that can represent the expressions.
Convert all expressions to domain elements using from_sympy().
Perform the calculation with the domain elements.
Convert back to Expr with to_sympy().

Here is an implementation of the sum function that illustrates these steps and sums some integers but performs the calculation using the domain elements rather than standard sympy expressions:

def sum_domain(expressions_sympy):
    """Sum sympy expressions but performing calculations in domain ZZ"""

    # Convert to domain
    expressions_dom = [ZZ.from_sympy(e) for e in expressions_sympy]

    # Perform calculations in the domain
    result_dom = ZZ.zero
    for e_dom in expressions_dom:
        result_dom += e_dom

    # Convert the result back to Expr
    result_sympy = ZZ.to_sympy(result_dom)
    return result_sympy

Gaussian integers and Gaussian rationals¶

The two example domains that we have seen so far are ZZ and QQ representing the integers and the rationals respectively. There are other simple domains such as ZZ_I and QQ_I representing the Gaussian integers and Gaussian rationals. The Gaussian integers are numbers of the form \(a\sqrt{-1} + b\) where \(a\) and \(b\) are integers. The Gaussian rationals are defined similarly except that \(a\) and \(b\) can be rationals. We can use the Gaussian domains like:

>>> from sympy import ZZ_I, QQ_I, I
>>> z = ZZ_I.from_sympy(1 + 2*I)
>>> z
(1 + 2*I)
>>> z**2
(-3 + 4*I)

Note the contrast with the way this calculation works in the tree representation where expand() is needed to get the reduced form:

>>> from sympy import expand, I
>>> z = 1 + 2*I
>>> z**2
(1 + 2*I)**2
>>> expand(z**2)
-3 + 4*I

The ZZ_I and QQ_I domains are implemented by the classes GaussianIntegerRing and GaussianRationalField and their elements by GaussianInteger and GaussianRational respectively. The internal representation for an element of ZZ_I or QQ_I is simply as a pair (a, b) of elements of ZZ or QQ respectively. The domain ZZ_I is a ring with similar properties to ZZ whereas QQ_I is a field much like QQ:

>>> ZZ.is_Field
False
>>> QQ.is_Field
True
>>> ZZ_I.is_Field
False
>>> QQ_I.is_Field
True

Since QQ_I is a field division by nonzero elements is always possible whereas in ZZ_I we have the important concept of the greatest common divisor (GCD):

>>> e1 = QQ_I.from_sympy(1+I)
>>> e2 = QQ_I.from_sympy(2-I/2)
>>> e1/e2
(6/17 + 10/17*I)
>>> ZZ_I.gcd(ZZ_I(5), ZZ_I.from_sympy(1+2*I))
(1 + 2*I)

Finite fields¶

So far we have seen the domains ZZ, QQ, ZZ_I, and QQ_I. There are also domains representing the Finite fields although the implementation of these is incomplete. A finite field GF(p) of prime order can be constructed with FF or GF. A domain for the finite field of prime order \(p\) can be constructed with GF(p):

>>> from sympy import GF
>>> K = GF(5)
>>> two = K(2)
>>> two
2 mod 5
>>> two ** 2
4 mod 5
>>> two ** 3
3 mod 5

There is also FF as an alias for GF (standing for “finite field” and “Galois field” respectively). These are equivalent and both FF(n) and GF(n) will create a domain which is an instance of FiniteField. The associated domain elements will be instances of PythonFiniteField or GMPYFiniteField depending on whether or not gmpy is installed.

Finite fields of order \(p^n\) where \(n \ne 1\) are not implemented. It is possible to use e.g. GF(6) or GF(9) but the resulting domain is not a field. It is just the integers modulo 6 or 9 and therefore has zero divisors and non-invertible elements:

>>> K = GF(6)
>>> K(3) * K(2)
0 mod 6

It would be good to have a proper implementation of prime-power order finite fields but this is not yet available in SymPy (contributions welcome!).

Real and complex fields¶

The fields RR and CC are intended mathematically to correspond to the reals and the complex numbers, \(\mathbb{R}\) and \(\mathbb{C}\) respectively. The implementation of these uses floating point arithmetic. In practice this means that these are the domains that are used to represent expressions containing floats. Elements of RR are instances of the class RealElement and have an mpf tuple which is used to represent a float in mpmath. Elements of CC are instances of ComplexElement and have an mpc tuple which is a pair of mpf tuples representing the real and imaginary parts. See the mpmath docs for more about how floating point numbers are represented:

>>> from sympy import RR, CC
>>> xr = RR(3)
>>> xr
3.0
>>> xr._mpf_
(0, 3, 0, 2)
>>> zc = CC(3+1j)
>>> zc
(3.0 + 1.0j)
>>> zc._mpc_
((0, 3, 0, 2), (0, 1, 0, 1))

The use of approximate floating point arithmetic in these domains comes with all of the usual pitfalls. Many algorithms in the sympy.polys module are fundamentally designed for exact arithmetic making the use of these domains potentially problematic:

>>> RR('0.1') + RR('0.2') == RR('0.3')
False

Since these are implemented using mpmath which is a multiprecision library it is possible to create different domains with different working precisions. The default domains RR and CC use 53 binary digits of precision much like standard double precision floating point which corresponds to approximately 15 decimal digits:

>>> from sympy.polys.domains.realfield import RealField
>>> RR.precision
53
>>> RR.dps
15
>>> RR(1) / RR(3)
0.333333333333333
>>> RR100 = RealField(100)
>>> RR100.precision
100
>>> RR100.dps
29
>>> RR100(1) / RR100(3)
0.33333333333333333333333333333

There is however a bug in the implementation of this so that actually a global precision setting is used by all RealElement. This means that just creating RR100 above has altered the global precision and we will need to restore it in the doctest here:

>>> RR(1) / RR(3)  # wrong result!
0.33333333333333333333333333333
>>> dummy = RealField(53)  # hack to restore precision
>>> RR(1) / RR(3)  # restored
0.333333333333333

(Obviously that should be fixed!)

Algebraic number fields¶

An algebraic extension of the rationals \(\mathbb{Q}\) is known as an algebraic number field and these are implemented in sympy as QQ<a>. The natural syntax for these would be something like QQ(sqrt(2)) however QQ() is already overloaded as the constructor for elements of QQ. These domains are instead created using the algebraic_field() method e.g. QQ.algebraic_field(sqrt(2)). The resulting domain will be an instance of AlgebraicField with elements that are instances of ANP.

The printing support for these is less developed but we can use to_sympy() to take advantage of the corresponding Expr printing support:

>>> K = QQ.algebraic_field(sqrt(2))
>>> K
QQ<sqrt(2)>
>>> b = K.one + K.from_sympy(sqrt(2))
>>> b  
ANP([1, 1], [1, 0, -2], QQ)
>>> K.to_sympy(b)
1 + sqrt(2)
>>> b ** 2  
ANP([2, 3], [1, 0, -2], QQ)
>>> K.to_sympy(b**2)
2*sqrt(2) + 3

The raw printed display immediately shows the internal representation of the elements as ANP instances. The field \(\mathbb{Q}(\sqrt{2})\) consists of numbers of the form \(a\sqrt{2}+b\) where \(a\) and \(b\) are rational numbers. Consequently every number in this field can be represented as a pair (a, b) of elements of QQ. The domain element stores these two in a list and also stores a list representation of the minimal polynomial for the extension element \(\sqrt{2}\). There is a sympy function minpoly() that can compute the minimal polynomial of any algebraic expression over the rationals:

>>> from sympy import minpoly, Symbol
>>> x = Symbol('x')
>>> minpoly(sqrt(2), x)
x**2 - 2

In the dense polynomial representation as a list of coefficients this polynomial is represented as [1, 0, -2] as seen in the ANP display for the elements of QQ<sqrt(2)> above.

It is also possible to create an algebraic number field with multiple generators such as \(\mathbb{Q}(\sqrt{2},\sqrt{3})\):

>>> K = QQ.algebraic_field(sqrt(2), sqrt(3))
>>> K
QQ<sqrt(2) + sqrt(3)>
>>> sqrt2 = K.from_sympy(sqrt(2))
>>> sqrt3 = K.from_sympy(sqrt(3))
>>> p = (K.one + sqrt2) * (K.one + sqrt3)
>>> p  
ANP([1/2, 1, -3/2], [1, 0, -10, 0, 1], QQ)
>>> K.to_sympy(p)
1 + sqrt(2) + sqrt(3) + sqrt(6)
>>> K.to_sympy(p**2)
4*sqrt(6) + 6*sqrt(3) + 8*sqrt(2) + 12

Here the algebraic extension \(\mathbb{Q}(\sqrt{2},\sqrt{3})\) is converted to the (isomorphic) \(\mathbb{Q}(\sqrt{2}+\sqrt{3})\) with a single generator \(\sqrt{2}+\sqrt{3}\). It is always possible to find a single generator like this due to the primitive element theorem. There is a sympy function primitive_element() that can compute the minimal polynomial for a primitive element of an extension:

>>> from sympy import primitive_element, minpoly
>>> e = primitive_element([sqrt(2), sqrt(3)], x)
>>> e[0]
x**4 - 10*x**2 + 1
>>> e[0].subs(x, sqrt(2) + sqrt(3)).expand()
0

The minimal polynomial x**4 - 10*x**2 + 1 has the dense list representation [1, 0, -10, 0, 1] as seen in the ANP output above. What the primitive element theorem means is that all algebraic number fields can be represented as an extension of the rationals by a single generator with some minimal polynomial. Calculations over the algebraic number field only need to take advantage of the minimal polynomial and that makes it possible to compute all arithmetic operations and also to carry out higher level operations like factorisation of polynomials.

Polynomial ring domains¶

There are also domains implemented to represent a polynomial ring like K[x] which is the domain of polynomials in the generator x with coefficients over another domain K:

>>> from sympy import ZZ, symbols
>>> x = symbols('x')
>>> K = ZZ[x]
>>> K
ZZ[x]
>>> x_dom = K(x)
>>> x_dom + K.one
x + 1

All the operations discussed before will work with elements of a polynomial ring:

>>> p = x_dom + K.one
>>> p
x + 1
>>> p + p
2*x + 2
>>> p - p
0
>>> p * p
x**2 + 2*x + 1
>>> p ** 3
x**3 + 3*x**2 + 3*x + 1
>>> K.exquo(x_dom**2 - K.one, x_dom - K.one)
x + 1

The internal representation of elements of K[x] is different from the way that ordinary sympy (Expr) expressions are represented. The Expr representation of any expression is as a tree e.g.:

>>> from sympy import srepr
>>> K = ZZ[x]
>>> p_expr = x**2 + 2*x + 1
>>> p_expr
x**2 + 2*x + 1
>>> srepr(p_expr)
"Add(Pow(Symbol('x'), Integer(2)), Mul(Integer(2), Symbol('x')), Integer(1))"

Here the expression is a tree where the top node is an Add and its children nodes are Pow etc. This tree representation makes it possible to represent equivalent expressions in different ways e.g.:

>>> x = symbols('x')
>>> p_expr = x*(x + 1) + x
>>> p_expr
x*(x + 1) + x
>>> p_expr.expand()
x**2 + 2*x

By contrast the domain ZZ[x] represents only polynomials and does so by simply storing the non-zero coefficients of the expanded polynomial (the “sparse” polynomial representation). In particular elements of ZZ[x] are represented as a Python dict. Their type is PolyElement which is a subclass of dict. Converting to a normal dict shows the internal representation:

>>> x = symbols('x')
>>> K = ZZ[x]
>>> x_dom = K(x)
>>> p_dom = K(3)*x_dom**2 + K(2)*x_dom + K(7)
>>> p_dom
3*x**2 + 2*x + 7
>>> dict(p_dom)
{(0,): 7, (1,): 2, (2,): 3}

This internal form makes it impossible to represent unexpanded multiplications so any multiplication of elements of ZZ[x] will always be expanded:

>>> x = symbols('x')
>>> K = ZZ[x]
>>> x_dom = K(x)
>>> p_expr = x * (x + 1) + x
>>> p_expr
x*(x + 1) + x
>>> p_dom = x_dom * (x_dom + K.one) + x_dom
>>> p_dom
x**2 + 2*x

These same considerations apply to powers:

>>> (x + 1) ** 2
(x + 1)**2
>>> (x_dom + K.one) ** 2
x**2 + 2*x + 1

We can also construct multivariate polynomial rings:

>>> x, y = symbols('x, y')
>>> K = ZZ[x,y]
>>> xk = K(x)
>>> yk = K(y)
>>> xk**2*yk + xk + yk
x**2*y + x + y

It is also possible to construct nested polynomial rings (although it is less efficient). The ring K[x][y] is formally equivalent to K[x,y] although their implementations in sympy are different:

>>> K = ZZ[x][y]
>>> p = K(x**2 + x*y + y**2)
>>> p
y**2 + x*y + x**2
>>> dict(p)
{(0,): x**2, (1,): x, (2,): 1}

Here the coefficients like x**2 are instances of PolyElement as well so this is a dict where the values are also dicts. The full representation is more like:

>>> {k: dict(v) for k, v in p.items()}
{(0,): {(2,): 1}, (1,): {(1,): 1}, (2,): {(0,): 1}}

The multivariate ring domain ZZ[x,y] has a more efficient representation as a single flattened dict:

>>> K = ZZ[x,y]
>>> p = K(x**2 + x*y + y**2)
>>> p
x**2 + x*y + y**2
>>> dict(p)
{(0, 2): 1, (1, 1): 1, (2, 0): 1}

The difference in efficiency between these representations grows as the number of generators increases i.e. ZZ[x,y,z,t,...] vs ZZ[x][y][z][t]....

Old (dense) polynomial rings¶

In the last section we saw that the domain representation of a polynomial ring like K[x] uses a sparse representation of a polynomial as a dict mapping monomial exponents to coefficients. There is also an older version of K[x] that uses the dense DMP representation. We can create these two versions of K[x] using poly_ring() and old_poly_ring() where the syntax K[x] is equivalent to K.poly_ring(x):

>>> K1 = ZZ.poly_ring(x)
>>> K2 = ZZ.old_poly_ring(x)
>>> K1
ZZ[x]
>>> K2
ZZ[x]
>>> K1 == ZZ[x]
True
>>> K2 == ZZ[x]
False
>>> p1 = K1.from_sympy(x**2 + 1)
>>> p2 = K2.from_sympy(x**2 + 1)
>>> p1
x**2 + 1
>>> p2
x**2 + 1
>>> type(K1)
<class 'sympy.polys.domains.polynomialring.PolynomialRing'>
>>> type(p1)
<class 'sympy.polys.rings.PolyElement'>
>>> type(K2)
<class 'sympy.polys.domains.old_polynomialring.GlobalPolynomialRing'>
>>> type(p2)
<class 'sympy.polys.polyclasses.DMP'>

The internal representation of the old polynomial ring domain is the DMP representation as a list of (lists of) coefficients:

>>> repr(p2)  
'DMP([1, 0, 1], ZZ, ZZ[x])'

The most notable use of the DMP representation of polynomials is as the internal representation used by Poly (this is discussed later in this page of the docs).

PolyRing vs PolynomialRing¶

You might just want to perform calculations in some particular polynomial ring without being concerned with implementing something that works for arbitrary domains. In that case you can construct the ring more directly with the ring() function:

>>> from sympy import ring
>>> K, xr, yr = ring([x, y], ZZ)
>>> K
Polynomial ring in x, y over ZZ with lex order
>>> xr**2 - yr**2
x**2 - y**2
>>> (xr**2 - yr**2) // (xr - yr)
x + y

The object K here represents the ring and is an instance of PolyRing but is not a polys domain (it is not an instance of a subclass of Domain so it can not be used with Poly). In this way the implementation of polynomial rings that is used in the domain system can be used independently of the domain system.

The purpose of the domain system is to provide a unified interface for working with and converting between different representations of expressions. To make the PolyRing implementation usable in that context the PolynomialRing class is a wrapper around the PolyRing class that provides the interface expected in the domain system. That makes this implementation of polynomial rings usable as part of the broader codebase that is designed to work with expressions from different domains. The domain for polynomial rings is a distinct object from the ring returned by ring() although both have the same elements:

>>> K, xr, yr = ring([x, y], ZZ)
>>> K
Polynomial ring in x, y over ZZ with lex order
>>> K2 = ZZ[x,y]
>>> K2
ZZ[x,y]
>>> K2.ring
Polynomial ring in x, y over ZZ with lex order
>>> K2.ring == K
True
>>> K(x+y)
x + y
>>> K2(x+y)
x + y
>>> type(K(x+y))
<class 'sympy.polys.rings.PolyElement'>
>>> type(K2(x+y))
<class 'sympy.polys.rings.PolyElement'>
>>> K(x+y) == K2(x+y)
True

Rational function fields¶

Some domains are classified as fields and others are not. The principal difference between a field and a non-field domain is that in a field it is always possible to divide any element by any nonzero element. It is usually possible to convert any domain to a field that contains that domain with the get_field() method:

>>> from sympy import ZZ, QQ, symbols
>>> x, y = symbols('x, y')
>>> ZZ.is_Field
False
>>> QQ.is_Field
True
>>> QQ[x]
QQ[x]
>>> QQ[x].is_Field
False
>>> QQ[x].get_field()
QQ(x)
>>> QQ[x].get_field().is_Field
True
>>> QQ.frac_field(x)
QQ(x)

This introduces a new kind of domain K(x) representing a rational function field in the generator x over another domain K. It is not possible to construct the domain QQ(x) with the () syntax so the easiest ways to create it are using the domain methods frac_field() (QQ.frac_field(x)) or get_field() (QQ[x].get_field()). The frac_field() method is the more direct approach.

The rational function field K(x) is an instance of RationalField. This domain represents functions of the form \(p(x) / q(x)\) for polynomials \(p\) and \(q\). The domain elements are represented as a pair of polynomials in K[x]:

>>> K = QQ.frac_field(x)
>>> xk = K(x)
>>> f = xk / (K.one + xk**2)
>>> f
x/(x**2 + 1)
>>> f.numer
x
>>> f.denom
x**2 + 1
>>> QQ[x].of_type(f.numer)
True
>>> QQ[x].of_type(f.denom)
True

Cancellation between the numerator and denominator is automatic in this field:

>>> p1 = xk**2 - 1
>>> p2 = xk - 1
>>> p1
x**2 - 1
>>> p2
x - 1
>>> p1 / p2
x + 1

Computing this cancellation can be slow which makes rational function fields potentially slower than polynomial rings or algebraic fields.

Just like in the case of polynomial rings there is both a new (sparse) and old (dense) version of fraction fields:

>>> K1 = QQ.frac_field(x)
>>> K2 = QQ.old_frac_field(x)
>>> K1
QQ(x)
>>> K2
QQ(x)
>>> type(K1)
<class 'sympy.polys.domains.fractionfield.FractionField'>
>>> type(K2)
<class 'sympy.polys.domains.old_fractionfield.FractionField'>

Also just like in the case of polynomials rings the implementation of rational function fields can be used independently of the domain system:

>>> from sympy import field
>>> K, xf, yf = field([x, y], ZZ)
>>> xf / (1 - yf)
-x/(y - 1)

Here K is an instance of FracField rather than RationalField as it would be for the domain ZZ(x,y).

Expression domain¶

The final domain to consider is the “expression domain” which is known as EX. Expressions that can not be represented using the other domains can be always represented using the expression domain. An element of EX is actually just a wrapper around a Expr instance:

>>> from sympy import EX
>>> p = EX.from_sympy(1 + x)
>>> p
EX(x + 1)
>>> type(p)
<class 'sympy.polys.domains.expressiondomain.ExpressionDomain.Expression'>
>>> p.ex
x + 1
>>> type(p.ex)
<class 'sympy.core.add.Add'>

For other domains the domain representation of expressions is usually more efficient than the tree representation used by Expr. In EX the internal representation is Expr so it is clearly not more efficient. The purpose of the EX domain is to be able to wrap up arbitrary expressions in an interface that is consistent with the other domains. The EX domain is used as a fallback when an appropriate domain can not be found. Although this does not offer any particular efficiency it does allow the algorithms that are implemented to work over arbitrary domains to be usable when working with expressions that do not have an appropriate domain representation.

Choosing a domain¶

In the workflow described above the idea is to start with some sympy expressions, choose a domain and convert all the expressions into that domain in order to perform some calculation. The obvious question that arises is how to choose an appropriate domain to represent some sympy expressions. For this there is a function construct_domain() which takes a list of expressions and will choose a domain and convert all of the expressions to that domain:

>>> from sympy import construct_domain, Integer
>>> elements_sympy = [Integer(3), Integer(2)]  # elements as Expr instances
>>> elements_sympy
[3, 2]
>>> K, elements_K = construct_domain(elements_sympy)
>>> K
ZZ
>>> elements_K
[3, 2]
>>> type(elements_sympy[0])
<class 'sympy.core.numbers.Integer'>
>>> type(elements_K[0])  
<class 'int'>

In this example we see that the two integers 3 and 2 can be represented in the domain ZZ. The expressions have been converted to elements of that domain which in this case means the int type rather than instances of Expr. It is not necessary to explicitly create Expr instances when the inputs can be sympified so e.g. construct_domain([3, 2]) would give the same output as above.

Given more complicated inputs construct_domain() will choose more complicated domains:

>>> from sympy import Rational, symbols
>>> x, y = symbols('x, y')
>>> construct_domain([Rational(1, 2), Integer(3)])[0]
QQ
>>> construct_domain([2*x, 3])[0]
ZZ[x]
>>> construct_domain([x/2, 3])[0]
QQ[x]
>>> construct_domain([2/x, 3])[0]
ZZ(x)
>>> construct_domain([x, y])[0]
ZZ[x,y]

If any noninteger rational numbers are found in the inputs then the ground domain will be QQ rather than ZZ. If any symbol is found in the inputs then a PolynomialRing will be created. A multivariate polynomial ring such as QQ[x,y] can also be created if there are multiple symbols in the inputs. If any symbols appear in the denominators then a RationalField like QQ(x) will be created instead.

Some of the domains above are fields and others are (non-field) rings. In some contexts it is necessary to have a field domain so that division is possible and for this construct_domain() has an option field=True which will force the construction of a field domain even if the expressions can all be represented in a non-field ring:

>>> construct_domain([1, 2], field=True)[0]
QQ
>>> construct_domain([2*x, 3], field=True)[0]
ZZ(x)
>>> construct_domain([x/2, 3], field=True)[0]
ZZ(x)
>>> construct_domain([2/x, 3], field=True)[0]
ZZ(x)
>>> construct_domain([x, y], field=True)[0]
ZZ(x,y)

By default construct_domain() will not construct an algebraic extension field and will instead use the EX domain (ExpressionDomain). The keyword argument extension=True can be used to construct an AlgebraicField if the inputs are irrational but algebraic:

>>> from sympy import sqrt
>>> construct_domain([sqrt(2)])[0]
EX
>>> construct_domain([sqrt(2)], extension=True)[0]
QQ<sqrt(2)>
>>> construct_domain([sqrt(2), sqrt(3)], extension=True)[0]
QQ<sqrt(2) + sqrt(3)>

When there are algebraically independent transcendentals in the inputs a PolynomialRing or RationalField will be constructed treating those transcendentals as generators:

>>> from sympy import sin, cos
>>> construct_domain([sin(x), y])[0]
ZZ[y,sin(x)]

However if there is a possibility that the inputs are not algebraically independent then the domain will be EX:

>>> construct_domain([sin(x), cos(x)])[0]
EX

Here sin(x) and cos(x) are not algebraically independent since sin(x)**2 + cos(x)**2 = 1.

Converting elements between different domains¶

It is often useful to combine calculations performed over different domains. However just as it is important to avoid mixing domain elements with normal sympy expressions and other Python types it is also important to avoid mixing elements from different domains. The convert_from() method is used to convert elements from one domain into elements of another domain:

>>> num_zz = ZZ(3)
>>> ZZ.of_type(num_zz)
True
>>> num_qq = QQ.convert_from(num_zz, ZZ)
>>> ZZ.of_type(num_qq)
False
>>> QQ.of_type(num_qq)
True

The convert() method can be called without specifying the source domain as the second argument e.g.:

>>> QQ.convert(ZZ(2))
2

This works because convert() can check the type of ZZ(2) and can try to work out what domain (ZZ) it is an element of. Certain domains like ZZ and QQ are treated as special cases to make this work. Elements of more complicated domains are instances of subclasses of DomainElement which has a parent() method that can identify the domain that the element belongs to. For example in the polynomial ring ZZ[x] we have:

>>> from sympy import ZZ, Symbol
>>> x = Symbol('x')
>>> K = ZZ[x]
>>> K
ZZ[x]
>>> p = K(x) + K.one
>>> p
x + 1
>>> type(p)
<class 'sympy.polys.rings.PolyElement'>
>>> p.parent()
ZZ[x]
>>> p.parent() == K
True

It is more efficient though to call convert_from() with the source domain specified as the second argument:

>>> QQ.convert_from(ZZ(2), ZZ)
2

Unifying domains¶

When we want to combine elements from two different domains and perform mixed calculations with them we need to

Choose a new domain that can represent all elements of both.
Convert all elements to the new domain.
Perform the calculation in the new domain.

The key question arising from point 1. is how to choose a domain that can represent the elements of both domains. For this there is the unify() method:

>>> x1, K1 = ZZ(2), ZZ
>>> y2, K2 = QQ(3, 2), QQ
>>> K1
ZZ
>>> K2
QQ
>>> K3 = K1.unify(K2)
>>> K3
QQ
>>> x3 = K3.convert_from(x1, K1)
>>> y3 = K3.convert_from(y2, K2)
>>> x3 + y3
7/2

The unify() method will find a domain that encompasses both domains so in this example ZZ.unify(QQ) gives QQ because every element of ZZ can be represented as an element of QQ. This means that all inputs (x1 and y2) can be converted to the elements of the common domain K3 (as x3 and y3). Once in the common domain we can safely use arithmetic operations like +. In this example one domain is a superset of the other and we see that K1.unify(K2) == K2 so it is not actually necessary to convert y2. In general though K1.unify(K2) can give a new domain that is not equal to either K1 or K2.

The unify() method understands how to combine different polynomial ring domains and how to unify the base domain:

>>> ZZ[x].unify(ZZ[y])
ZZ[x,y]
>>> ZZ[x,y].unify(ZZ[y])
ZZ[x,y]
>>> ZZ[x].unify(QQ)
QQ[x]

It is also possible to unify algebraic fields and rational function fields as well:

>>> K1 = QQ.algebraic_field(sqrt(2))[x]
>>> K2 = QQ.algebraic_field(sqrt(3))[y]
>>> K1
QQ<sqrt(2)>[x]
>>> K2
QQ<sqrt(3)>[y]
>>> K1.unify(K2)
QQ<sqrt(2) + sqrt(3)>[x,y]
>>> QQ.frac_field(x).unify(ZZ[y])
ZZ(x,y)

Internals of a Poly¶

We are now in a position to understand how the Poly class works internally. This is the public interface of Poly:

>>> from sympy import Poly, symbols, ZZ
>>> x, y, z, t = symbols('x, y, z, t')
>>> p = Poly(x**2 + 1, x, domain=ZZ)
>>> p
Poly(x**2 + 1, x, domain='ZZ')
>>> p.gens
(x,)
>>> p.domain
ZZ
>>> p.all_coeffs()
[1, 0, 1]
>>> p.as_expr()
x**2 + 1

This is the internal implementation of Poly:

>>> d = p.rep  # internal representation of Poly
>>> d
DMP([1, 0, 1], ZZ, None)
>>> d.rep      # internal representation of DMP
[1, 0, 1]
>>> type(d.rep)
<class 'list'>
>>> type(d.rep[0])  
<class 'int'>
>>> d.dom
ZZ

The internal representation of a Poly instance is an instance of DMP which is the class used for domain elements in the old polynomial ring domain old_poly_ring(). This represents the polynomial as a list of coefficients which are themselves elements of a domain and keeps a reference to their domain (ZZ in this example).

Choosing a domain for a Poly¶

If the domain is not specified for the Poly constructor then it is inferred using construct_domain(). Arguments like field=True are passed along to construct_domain():

>>> from sympy import sqrt
>>> Poly(x**2 + 1, x)
Poly(x**2 + 1, x, domain='ZZ')
>>> Poly(x**2 + 1, x, field=True)
Poly(x**2 + 1, x, domain='QQ')
>>> Poly(x**2/2 + 1, x)
Poly(1/2*x**2 + 1, x, domain='QQ')
>>> Poly(x**2 + sqrt(2), x)
Poly(x**2 + sqrt(2), x, domain='EX')
>>> Poly(x**2 + sqrt(2), x, extension=True)
Poly(x**2 + sqrt(2), x, domain='QQ<sqrt(2)>')

It is also possible to use the extension argument to specify generators of an extension even if no extension is required to represent the coefficients although this does not work when using construct_domain() directly. A list of extension elements will be passed to primitive_element() to create an appropriate AlgebraicField domain:

>>> from sympy import construct_domain
>>> Poly(x**2 + 1, x)
Poly(x**2 + 1, x, domain='ZZ')
>>> Poly(x**2 + 1, x, extension=sqrt(2))
Poly(x**2 + 1, x, domain='QQ<sqrt(2)>')
>>> Poly(x**2 + 1, x, extension=[sqrt(2), sqrt(3)])
Poly(x**2 + 1, x, domain='QQ<sqrt(2) + sqrt(3)>')
>>> construct_domain([1, 0, 1], extension=sqrt(2))[0]
ZZ

(Perhaps construct_domain() should do the same as Poly here…)

Choosing generators¶

If there are symbols other than the generators then a polynomial ring or rational function field domain will be created. The domain used for the coefficients in this case is the sparse (“new”) polynomial ring:

>>> p = Poly(x**2*y + z, x)
>>> p
Poly(y*x**2 + z, x, domain='ZZ[y,z]')
>>> p.gens
(x,)
>>> p.domain
ZZ[y,z]
>>> p.domain == ZZ[y,z]
True
>>> p.domain == ZZ.poly_ring(y, z)
True
>>> p.domain == ZZ.old_poly_ring(y, z)
False
>>> p.rep.rep
[y, 0, z]
>>> p.rep.rep[0]
y
>>> type(p.rep.rep[0])
<class 'sympy.polys.rings.PolyElement'>
>>> dict(p.rep.rep[0])
{(1, 0): 1}

What we have here is a strange hybrid of dense and sparse implementations. The Poly instance considers itself to be an univariate polynomial in the generator x but with coefficients from the domain ZZ[y,z]. The internal representation of the Poly is a list of coefficients in the “dense univariate polynomial” (DUP) format. However each coefficient is implemented as a sparse polynomial in y and z.

If we make x, y and z all be generators for the Poly then we get a fully dense DMP list of lists of lists representation:

>>> p = Poly(x**2*y + z, x, y, z)
>>> p
Poly(x**2*y + z, x, y, z, domain='ZZ')
>>> p.rep
DMP([[[1], []], [[]], [[1, 0]]], ZZ, None)
>>> p.rep.rep
[[[1], []], [[]], [[1, 0]]]
>>> p.rep.rep[0][0][0]
1
>>> type(p.rep.rep[0][0][0])  
<class 'int'>

On the other hand we can make a Poly with a fully sparse representation by choosing a generator that is not in the expression at all:

>>> p = Poly(x**2*y + z, t)
>>> p
Poly(x**2*y + z, t, domain='ZZ[x,y,z]')
>>> p.rep
DMP([x**2*y + z], ZZ[x,y,z], None)
>>> p.rep.rep[0]
x**2*y + z
>>> type(p.rep.rep[0])
<class 'sympy.polys.rings.PolyElement'>
>>> dict(p.rep.rep[0])
{(0, 0, 1): 1, (2, 1, 0): 1}

If no generators are provided to the Poly constructor then it will attempt to choose generators so that the expression is polynomial in those. In the common case that the expression is a polynomial expression in some symbols then those symbols will be taken as generators. However other non-symbol expressions can also be taken as generators:

>>> Poly(x**2*y + z)
Poly(x**2*y + z, x, y, z, domain='ZZ')
>>> from sympy import pi, exp
>>> Poly(exp(x) + exp(2*x) + 1)
Poly((exp(x))**2 + (exp(x)) + 1, exp(x), domain='ZZ')
>>> Poly(pi*x)
Poly(x*pi, x, pi, domain='ZZ')
>>> Poly(pi*x, x)
Poly(pi*x, x, domain='ZZ[pi]')

Algebraically dependent generators¶

Taking exp(x) or pi as generators for a Poly or for its polynomial ring domain is mathematically valid because these objects are transcendental and so the ring extension containing them is isomorphic to a polynomial ring. Since x and exp(x) are algebraically independent it is also valid to use both as generators for the same Poly. However some other combinations of generators are invalid such as x and sqrt(x) or sin(x) and cos(x). These examples are invalid because the generators are not algebraically independent (e.g. sqrt(x)**2 = x and sin(x)**2 + cos(x)**2 = 1). The implementation is not able to detect these algebraic relationships though:

>>> from sympy import sin, cos, sqrt
>>> Poly(x*exp(x))      # fine
Poly(x*(exp(x)), x, exp(x), domain='ZZ')
>>> Poly(sin(x)+cos(x)) # not fine
Poly((cos(x)) + (sin(x)), cos(x), sin(x), domain='ZZ')
>>> Poly(x + sqrt(x))   # not fine
Poly(x + (sqrt(x)), x, sqrt(x), domain='ZZ')

Calculations with a Poly such as this are unreliable because zero-testing will not work properly in this implementation:

>>> p1 = Poly(x, x, sqrt(x))
>>> p2 = Poly(sqrt(x), x, sqrt(x))
>>> p1
Poly(x, x, sqrt(x), domain='ZZ')
>>> p2
Poly((sqrt(x)), x, sqrt(x), domain='ZZ')
>>> p3 = p1 - p2**2
>>> p3                  # should be zero...
Poly(x - (sqrt(x))**2, x, sqrt(x), domain='ZZ')
>>> p3.as_expr()
0

This aspect of Poly could be improved by:

Expanding the domain system with new domains that can represent more classes of algebraic extension.
Improving the detection of algebraic dependencies in construct_domain().
Improving the automatic selection of generators.

Examples of the above are that it would be useful to have a domain that can represent more general algebraic extensions (AlgebraicField is only for extensions of QQ). Improving the detection of algebraic dependencies is harder but at least common cases like sin(x) and cos(x) could be handled. When choosing generators it should be possible to recognise that sqrt(x) can be the only generator for x + sqrt(x):

>>> Poly(x + sqrt(x))            # this could be improved!
Poly(x + (sqrt(x)), x, sqrt(x), domain='ZZ')
>>> Poly(x + sqrt(x), sqrt(x))   # this could be improved!
Poly((sqrt(x)) + x, sqrt(x), domain='ZZ[x]')