Relational Operators on Generalized Quantifiers

Charles Kurtz
Central Missouri State University, Syracuse University

The nuances of natural languages remain unappreciated as long as they remain misunderstood. Logicians and linguists may be forgiven for showing annoyance with the quirks and special cases of a natural language during the period when the underlying structure is unknown. When this structure is eventually unearthed, the annoyance can turn to delight, and what previously struck the mind as bizarre and ungainly is imbued with a sense of beauty, elegance and inevitability. At least, this has been my experience, most recently with questions about relational operators on generalized quantifiers.

Generalized quantifiers are the mathematical analogue to natural language determiners. The natural language determiner binds with a noun to form a noun phrase, and the result binds with a verb phrase to form a sentence. For example, consider the following sentences

The natural language determiner in each case is put into italics. Any term that can be substituted for one of the determiners above, without changing the grammatical structure of the sentence, will also be a natural language determiner.

It has been proposed that the logical analogue for such natural language determiners is a generalized quantifier. Lindstrøm came up with a generalization of quantifier terms based on their role in a sentence. His definition had the function take the extension of the subject to a set of acceptable extensions of the predicate. The extension of the subject and predicate he took to be an element of the power set of the universe, so if the generalized quantifier took the subject (a member of the power set of the universe) to a set of acceptable predicates (which was also a member of the power set), then the domain would be the power set of the universe, and the range would be the power set of the power set of the universe.

Historical Definition 1: A binary generalized quantifier is a function from the power set of the universe to the power set of the power set of the universe. I.e. a binary generalized quantifier is a function Qm where Qm : P (M) -> P (P (M)).

I will save the term 'generalized quantifier' for a generalization of the definition above, and when I have need, call the above function a model based quantifier. The sentence 'QAB' will be true when, for an appropriate universe M, B is in Qm(A).

Such a classification will include many functions that do not appear to be quantificational in character, however, so some natural language constraints were suggested by Barwise & Cooper, and by Keenan & Stavi. The constraints are as follows:

Conservativity: B is in Qm(A) iff A&B is in Qm(A).
Extension: If M and N are universes such that M contains N, and A and B are elements of P (N), then B is in Qm(A) iff B is in Qn(A).
Quantity: For A, B, C, D in P (M), if |A&B|=|C&D|, |A-B|=|C-D|, |B-A|=|D-C|,
and |(A&B)*|=|(C&D)*|, where |A| is the number of elements in A, and A* is the complement of A in M,then B is in Qm(A) iff D is in Qm(C).
Permutation Invariance: For any bijection, ¼, from M to N, B is in Qm(A) iff ¼(B) is in Qn( ¼(A)).
Finiteness: |M| < ƒ.

A model-based quantifier which has the properties listed above will be called a BC (Barwise&Cooper) quantifier. Such constraints will rule out some natural language determiners. For example, possession violates Quantity and Permutation Invariance, and the determiners 'only' and 'mostly' violate Conservativity. Finiteness is the most objectionable of the constraints, but it does provide greater simplicity. Under these constraints, the truth of a quantified statement, 'QAB', can be judged by looking at the size of the intersection set, |A&B|, and the size of the difference set, |A-B|.

This leads to an alternative characterization of the BC quantifiers, and a graphical representation due to van Benthem.
Definition 2: The characteristic of a BC quantifier is a function from NxN -> {T,F}.
In general, the quantifier characteristic maps the ordered pair (|A&B|,|A B|) to true or false. For example, All will map ordered pairs to true when, and only when, the second coordinate is zero, while Some will map ordered pairs to true when, and only when, the first coordinate is not zero. Additional quantifiers would include half, which maps the ordered pair (|A&B|,|A-B|) to true when, and only when, |A&B|/(|A&B|+|A-B|) = 1/2. As for the graphical representation, I will use a stylistic variant of van Benthem's tree of numbers, and we will have the following table, with points labeled.

     (0,0)     (1,0)     (2,0)     (3,0)     (4,0)     (5,0)     ...
     (0,1)     (1,1)     (2,1)     (3,1)     (4,1)     (5,1)     ...
     (0,2)     (1,2)     (2,2)     (3,2)     (4,2)     (5,2)     ...
     (0,3)     (1,3)     (2,3)     (3,3)     (4,3)     (5,3)     ...
     (0,4)     (1,4)     (2,4)     (3,4)     (4,4)     (5,4)     ...
     (0,5)     (1,5)     (2,5)     (3,5)     (4,5)     (5,5)     ...
       :       :       :       :       :       :

Remember that All maps any point of the form (m,0) to true, and Some maps any point of the from (m,n) where n‚0 to true. We will mark the points mapped to true with a '+' and the points marked to false with a '-'. This will give us the following tables.
     All                                   Some
 +     +     +     +     + ...               -     +     +     +     + ...
 -     -     -     -     - ...               -     +     +     +     + ...
 -     -     -     -     - ...               -     +     +     +     + ...
 -     -     -     -     - ...               -     +     +     +     + ...
 :     :     :     :     :                    :     :     :     :     :

     Half                                   Three
 -     -     -     -     - ...               -     -     -     +     - ...
 -     +     -     -     - ...               -     -     -     +     - ...
 -     -     +     -     - ...               -     -     -     +     - ...
 -     -     -     +     - ...               -     -     -     +     - ...
 :     :     :     :     :                    :     :     :     :     :

In addition to the graphs, there will be what I call the solution set of the BC quantifier characteristic. For a given BC quantifier characteristic, Q, the solution set will be T (Q), so for example, T (All) = {(m,n) | n = 0}, T (Some) = {(m,n) | m ‚ 0}, T (Half) = {(m,n) | m/(m+n) = 0} and T (Three) = {(m,n) | m = 3}.

The BC quantifier characteristics will be closed under the normal Boolean connectives, but what I have been studying lately has been the role of relational operators on the BC quantifier characteristics. In natural language the relational operators are 'at least', 'at most', 'more than' and 'less than'. In terms of the generalized quantifiers they can be quite difficult to define. My goal of late has been to define these relational operators as a global function carrying BC quantifier characteristics to BC quantifier characteristics in a manner that accords with natural language usage. So, for 'at least', the function should give the following tables.

     At least Half                    at least Three
 -     +     +     +     + ...               -     -     -     +     + ...
 -     +     +     +     + ...               -     -     -     +     + ...
 -     -     +     +     + ...               -     -     -     +     + ...
 -     -     -     +     + ...               -     -     -     +     + ...
 :     :     :     :     :                    :     :     :     :     :

In addition, the common reading of 'at least all but one' is 'all but at most one', and the typical reading of 'at least one of the three' requires that the size of the subject set be three. Their tables follow.

     at least all but one           at least one of the three
 +     +     +     +     + ...               -     -     -     +      - ...
 +     +     +     +     + ...               -     -     +     -     - ...
 -     -     -     -     - ...               -     +     -     -     - ...
 -     -     -     -     - ...               -     -     -     -     - ...
 :     :     :     :     :                    :     :     :     :     :

With these as hints, all that remains is to find a pattern. One common suggestion is that for 'at least', from each '+' you fill both up and to the right.

Hypothesis 1: T (at least Q) = {(m,n)|(m',n') in T (Q) and m >= m' and n >= n' .

This will work for many of the BC quantifier characteristics, though it does not obviously apply to the operator on the partitive 'at least one of the three'.
This difficulty could be solved by saying that in the case of the partitive, the operator has minimal scope, thus distinguishing between a typical determiner, and one that is never used, as indicated below.

 At least (one of the three)     (at least one) of the three
 -     +     +     +     + ...               -     -     -     +     - ...
 -     +     +     +     + ...               -     -     +     -     - ...
 -     +     +     +     + ...               -     +     -     -     - ...
 -     -     -     -     - ...               -     -     -     -     - ...
 :     :     :     :     :                    :     :     :     :     :

That is, if we were to apply the relational operator to the partitive, we would get the table on the left, but the relational operator is never used in that way. Instead, the apparent use is explained by using the table on the right.

This is not the end of our difficulties. Consider the BC quantifier characteristic

     Three of five
 -     -     -     -     -     -     -     -     -     -     - ...
 -     -     -     -     -     -     -     -     -     -     - ...
 -     -     -     +     -     -     -     -     -     -     - ...
 -     -     -     -     -     -     -     -     -     -     - ...
 -     -     -     -     -     -     +     -     -     -     - ...
 -     -     -     -     -     -     -     -     -     -     - ...
 -     -     -     -     -     -     -     -     -     +     - ...
 -     -     -     -     -     -     -     -     -     -     - ...
 :     :     :     :     :     :     :     :     :     :     :

If we fill up and to the right from the existing '+'s, we will get the following result.

     Up-right closure of three of five
 -     -     -     +     +     +     +     +     +     +     + ...
 -     -     -     +     +     +     +     +     +     +     + ...
 -     -     -     +     +     +     +     +     +     +     + ...
 -     -     -     -     -     -     +     +     +     +     + ...
 -     -     -     -     -     -     +     +     +     +     + ...
 -     -     -     -     -     -     -     -     -     +     + ...
 -     -     -     -     -     -     -     -     -     +     + ...
 -     -     -     -     -     -     -     -     -     -     - ...
 :     :     :     :     :     :     :     :     :     :     :

This is clearly not the table for at least one of the three, which follows with the changes blinking.

     At least three of five
 -     +     +     +     +     +     +     +     +     +     + ...
 -     -     +     +     +     +     +     +     +     +     + ...
 -     -     -     +     +     +     +     +     +     +     + ...
 -     -     -     -     -     +     +     +     +     +     + ...
 -     -     -     -     -     -     +     +     +     +     + ...
 -     -     -     -     -     -     -     -     +     +     + ...
 -     -     -     -     -     -     -     -     -     +     + ...
 -     -     -     -     -     -     -     -     -     -     - ...
 :     :     :     :     :     :     :     :     :     :     :


The addition of the points (1,0) and (2,0) is intriguing, but depends on whether the point (0,0) belongs in the solution set for Three of five, a point which is debatable. The addition of the other points is more problematic. Consider, for example, the point (5,3). We want to include this point since 5/(5+3) = .625 >= .6 = 3/5. Unfortunately, this point is not above and to the right of any of the points in T (Three of five).

In this case, since T (Three of five) = {(m,n) | m/(m+n) = 3/5}, the temptation is great to say that T (At least three of five) = {(m,n) | m/(m+n) >= 3/5}. I believe this move to be illegitimate, however, since there are many ways of expressing the solution set. In particular,
T (Three of five) = T ((Three of the five) or (60% of the at least 10)).
This makes them the same BC quantifier characteristic, so relational operators should work equivalently on either formulation. Given that there are different formulas for picking out the same solution set, how can we justify using the formula m/(m+n) >= 3/5?

An additional problem involves a property on relational operators that I would like to be true. It seems intuitively true that 'at least' distributes over disjunctions, that is that
T (>=(Q1 or Q2)) = T ((>=Q1) or (>=Q2))
If this is the case, however, then we would have the following table for

 >=((Three of the five) or (60% of the at least 10)).
 -     -     -     +     +     +     +     +     +     +     +     +     ...
 -     -     -     +     +     +     +     +     +     +     +     +     ...
 -     -     -     +     +     +     +     +     +     +     +     +     ...
 -     -     -     -     -     -     +     +     +     +     +     +     ...
 -     -     -     -     -     -     +     +     +     +     +     +     ...
 -     -     -     -     -     -     -     -     +     +     +     +     ...
 -     -     -     -     -     -     -     -     -     +     +     +     ...
 :     :     :     :     :     :     :     :     :     :     :     :

Again, there is the troubling behavior near the origin, but more troubling still is that (5,3) will not be in the solution set.

To make the problem more vivid, consider the BC quantifier characteristic for at least ¼/4. While it may seem strange to suggest such a characteristic, involving an irrational proportion, there are uses for it. Suppose that we inscribe a square with a circle, and drop ball bearings in a random distribution on the square. We then check to see if at least ¼/4 of the ball bearings fell within the circle. This seems to make sense, and I can give the table for this quantifier.

 >=(¼-4ths)
 -     +     +     +     +     +     +     +     +     +     +     +  ...
 -     -     -     -     +     +     +     +     +     +     +     +  ...
 -     -     -     -     -     -     -     -     +     +     +     +  ...
 -     -     -     -     -     -     -     -     -     -     -     +  ...
 -     -     -     -     -     -     -     -     -     -     -     -  ...
 -     -     -     -     -     -     -     -     -     -     -     -  ...
 -     -     -     -     -     -     -     -     -     -     -     -  ...
 :     :     :     :     :     :     :     :     :     :     :     :

Here we have an acceptable BC quantifier characteristic, but it does not seem to arise from anything approaching an up-right closure of a BC quantifier characteristic since the solution set for the BC quantifier characteristic ¼-4ths would be the empty set. What is the BC quantifier characteristic which at least maps to at least ¼-4ths?

These were the problems I found with attempting to give a formulation for a relational operator on BC quantifiers. First, relational operators did not apply naturally to partitives. Second, the solution set could be given by different mathematical formula s, so the relations could not focus on any one of the formulas. Lastly, there are BC quantifiers that use the relational operators which have no apparent pre-image.

At the same time that I was grappling with these problems, I was also trying to find a technique for dealing with natural language determiners for mass nouns. These occur in such sentences as:

My solution was to treat generalized quantifiers as functions from measure spaces to measured generalized quantifiers, i.e. Q:(M,A,µ) -> Q(M,A,µ). One class of measure space is (M,P (M),|€|), where the algebra is the power set of the universe, and the measure is the count measure. A measured generalized quantifier of the form Q(M,P (M),|€|) would be a model-based quantifier, and if it fit the natural language constraints mentioned above, it would be a BC quantifier.

Generalized quantifiers which map measure spaces to measured generalized quantifiers which fit the natural constraints given above are called natural language (NL) quantifiers, and as with the BC quantifiers, there is a characteristic function for NL quantifiers.

€Definition 3: The characteristic of a NL quantifier is a function from R*xR* -> {T,F}.

Where R* stands for the non-negative reals. Instead of using a table to symbolize the solution sets, I will use a graph, with Cartesian style coordinates. Instead of putting '+'s on the graph, I will use points, lines and shaded areas to show what points are mapped to true. Some examples of graphs of NL quantifier characteristics are given below.

     two and a half        ¼-4ths           all but 20


Now, instead of the previously suggested up-right closure, there is a simpler way to define the relational operators on NL quantifiers. What I suggest is the following definition.

€definition 4: T (>=Q) = {(m,n) | (m',n') is in T (Q) s.t. m+n = m'+n' & m >= m'}.

What this amounts to is including every point that is above, and on the same diagonal as, a point which the characteristic maps to true. I will call this the closure of the graph up the count diagonal.

Let us return to the previous cases in which relational operators on BC quantifiers caused problems. First the partitive one of the three.

 one of the three  at least one of the three


Note that the points with integral coefficients match the graph that we expected for the natural language determiner 'at least one of the three'.

Next, the case where there are different functions which pick out the same collection of ordered pairs of integers. In this case, the function differs on the non-integral coefficients, and this can be used to explain the difference on the integral coefficients for the relational operators.

     >=(3 of 5)                 >=((3 of the 5)or(60% of the >=10))


Note that although the graphs for 3 of 5 and 3 of the 5 or 60% of the at least 10 match on the integral coefficients, though their closures up the count-diagonals does not.

Thirdly, the case where the pre-image of the relational operator, has no integral coefficient.

        ¼-4ths               >=(¼-4ths)


While ¼-4ths has no points on its characteristic graph with integral coefficients, its closure up the count-diagonal does.

The literature on generalized quantifiers currently considers only integral values. I think that the current notion of a generalized quantifier can be made richer by using a broader definition, which includes measure spaces other than those using the count measure, and which uses non-integral values. Not only can it be used for situations where mass nouns are used, it can also be used to define the relational operators on BC quantifiers. Even in the case where you are using count-nouns, relational operators work on functions with non-integral coefficients.

For the record, though I have only used the relational operator 'at least' until now, the definition of the other relational operators is as follows.
€ definition 5:
€a) T (at most Q) = {(m,n) | (m',n') is in T (Q) s.t. m+n = m'+n' & m ¾ m'}
€b) T (less than Q)= {(m,n) | (m,n) is in T (at least Q)}
€c) T (more than Q) = {(m,n) | (m,n) is in T (at most Q)}
This definition, as well as definition 4, will work on any natural language quantifier, but will match the intuitions of natural language only in the case where we look at the non-integral coefficients of natural language quantifiers.

Bibliography
€ Barwise, J. and Cooper, R., (1981), 'Generalized Quantifiers and Natural Languages', Linguistics and Philosophy 4 , 159-219
€ van Benthem, J.,(1985), 'Themes from a Workshop', in van Benthem and ter Meulen (eds), Generalized Quantifiers in Natural Language .
€ van Benthem, J.,(1987), 'Towards a Computational Semantics', in P. Gardenfors (ed) Generalized Quantifiers: Linguistic and Logical Approaches .
€ Keenan, E.L. and Stavi, J., (1986) 'A Semantic Characterization of Natural Language Determiners', Linguistics and Philosophy 9 , 253-326
€ Lindstrøm, Per,(1966), 'First Order Predicate Logic with Generalized Quantifiers', Theoria 32, 186-195
€ Westerstahl, D., (1989), 'Quantifiers in Formal and Natural Languages', in Gabbay and Guenther (eds.), Handbook of Philosophical Logic, Volume IV , D. Reidel, Dordrecht 1989, 1-131


send comments to author

cwkurtz@cctr.umkc.edu