8

Edited to incorporate suggestions from the comments and responses:

Typically, the definition of a group is as follows:

Definition: If $S$ is a set, $*$ is a binary operation on $S$, and $e \in S$, then $G = (S,e,*)$ is called a group if

(i) $(ab)c = a(bc)$, $\forall a,b,c \in S$ (associativity);

(ii) $\exists e \in S$ such that $ae = a = ea$, $\forall a \in S$ (identity); and

(iii) $\forall a \in S$, $\exists b \in S$ such that $ab = e = ba$ (inverse).

Consider the following definition.

Definition: If $S$ is a set, $*$ is a binary operation on $S$, and $e \in S$, then $G = (S,e,*)$ is called a group if

(i) $(ab)c = a(bc)$, $\forall a,b,c \in S$ (associativity);

(ii) $\exists e \in S$ such that $ae = a$, $\forall a \in S$ (right identity); and

(iii) $\forall a \in S$, $\exists b \in S$ such that $ab = e$ (right inverse).

It an be shown that these axioms imply that every right inverse is a left inverse and that $e$ is a left identity. (Of course, there's nothing special about using right identity and right inverse and that we could also take left identity and left inverse as axiomatic.)

Question 1: In most undergraduate textbooks in abstract algebra I've seen (I realize this is anecdotal), the first definition is used. Is there a reason that authors use the first definition and not a variant of the second one? This seems strange to me given that it is desirable to make definitions as lean as possible.

Question 2: Alternately, are there textbooks that employ the second definition (or a variant thereof)?

Pietro Paparella
  • 3,690
  • 1
  • 21
  • 30
  • 12
    The first definition is nicer. – Angina Seng Oct 14 '20 at 20:57
  • 4
    I agree with that, but note that uniqueness of the inverse is not included in the axioms because it can be derived from them—hence my question. – Pietro Paparella Oct 14 '20 at 21:01
  • 24
    Axioms are not always made in order to be "minimal". It has also many advantages to require more (but not too much) in order to make the definition more understandable. – Dietrich Burde Oct 14 '20 at 21:07
  • 1
    The "with respect to " part is redundant since you already defined $G$ as a pair including "". – Carsten S Oct 15 '20 at 09:29
  • It's worth noting that you can make this even leaner. It's enough to have that the operation is associative and that the $\forall a,\forall b \exists c,d$ such that $ac=b \wedge da=b$. That only gives you "two axioms" and you don't have to define a unit. – DRF Oct 15 '20 at 11:38
  • Small mistake in the above unless the structure is finite you also need a $!$ after that exists. That is there exist unique c and d, is I believe, necessary – DRF Oct 15 '20 at 12:48
  • 5
    I prefer the explicit symmetry of the first definition. To me, that’s more important than leanness. – Randall Oct 15 '20 at 17:05
  • 1
    In addition to the other helpful comments, I think that the 'symmetric' definition maps naturally with the features that people already know from previous math courses:
    • numbers with addition, 0

    • nonzero numbers with multiplication, 1

    • nonsingular matrices, multiplication, I

    There's no sense breaking brains up front with "left and right identities could be different!" until students develop some intuition.

    – nilpotence Oct 16 '20 at 02:37
  • 1
    @DRF: I don't think you need to require uniqueness of the solutions (any more than you need to require uniqueness of the identity). Assuming solutions exist, any one solution to $ax=a$ can be shown to also satisfy $bx=b$ for all $b$ (by finding $y$ with $ya=b$ so $bx = (ya)x = y(ax) = b$) giving a right identity, and then you have solutions to $ac=x$ for all $a$ giving right inverses, and you are done. You do not need to require uniqueness. – Arturo Magidin Oct 16 '20 at 16:28
  • @ArturoMagidin Yes you are quite correct. I knew it worked for finite ones. I thought you needed cancellation but apparently just existence of right and left division is enough. – DRF Oct 26 '20 at 14:02
  • I have mentioned some books with the "leaner" definition in this answer: Right identity and Right inverse implies a group. Maybe it is possible to find a few also in other related posts. – Martin Sleziak Sep 26 '21 at 03:50

5 Answers5

19

Question 2: Here are four examples from my bookshelves:

  1. Derek Robinson's A Course in the Theory of Groups, 2nd Edition (Springer, GTM 80), defines a group as a semigroup (nonempty set with an associative binary operation) that has a right identity and right inverses (page 1; he proves they also work on the left in 1.1.2, on page 2).

  2. Marshall Hall, Jr.'s The Theory of Groups (AMS Chelsea Publishing is the version I am looking at). Gives both the two-sided and the one-sided versions as part of the overall definition of a group, indicating the two-sided one is "redundant" and giving the one-sided one.

  3. Burnside, Theory of groups of finite order (I'm looking at Dover's 1959 reprint of the 1911 second edition of the original) defines a group asking only that elements have left inverses. He does not require an identity explicitly, but it follows from the requirement for left inverses.

  4. van der Waerden's Algebra requires only a left identity and left inverses.

Question 1. If you want to fit groups into the more general theory of semigroups, monoids, etc., then note that in a monoid you must specify that the identity is two-sided; you cannot deduce it. So if you want to say something like "A group is a monoid in which every element has inverses", or "A group is a semigroup that has an identity and also has inverses for every element" (this is basically what Bourbaki does), then your specification of identity must be two-sided, in which case having the inverse be defined one-sided looks a little strange.

Also, the definitions match the experience that most undergraduates will have had: at this point, they are probably familiar with the usual numerical examples ($\mathbb{Z}$, $\mathbb{R}$, $\mathbb{Q}$, $\mathbb{C}$, maybe the quaternions, possibly the integers modulo $n$), and possibly also matrices. So it's best to give a definition that matches expectations even if it is a bit more involved/superfluous, than one that is formally more inclusive (by putting fewer conditions on the object) but which may seem to invite queries when matched with the usual examples. Especially when one almost immediately would show that they are in fact two-sided.

Arturo Magidin
  • 417,286
14

Technically speaking, neither of your definitions is correct, because (iii) refers to an undefined $e$. From a rigorous point of view, you have 2 options:

Option 1

$(G,*,e)$ is a group iff $*$ is a binary operation on $G$ and $e∈G$ and:

  1. $∀a,b,c\ ( \ (a*b)*c = a*(b*c) \ )$.
  2. $∀a\ ( \ a*e = e*a = a \ )$.
  3. $∀a\ ∃b\ ( \ a*b = b*a = e \ )$.

Option 2

$(G,*)$ is a group iff $*$ is a binary operation on $G$ and:

  1. $∀a,b,c\ ( \ (a*b)*c = a*(b*c) \ )$.
  2. $∃e\ ( \ ∀a\ ( \ a*e = e*a = a ∧ ∃b\ ( \ a*b = b*a = e \ ) \ )$.

~ ~ ~

That aside, it is wrong to think that it is at all good to have the 'leanest' definition possible. For example, PL (propositional logic) can be axiomatized by a single axiom sentence schema (e.g. Meredith's as given on wikipedia). If you want to make things worse, use the sheffer stroke (NAND) and no other boolean connectives, just because NAND is functionally-complete.

Another example is PA (First-order Peano Arithmetic). The axiomatization of PA in terms of a discrete ordered semi-ring with induction is far superior to the successor-based axiomatization, simply because it reveals more of the true structure of the intended model $ℕ$ of PA. In fact, the motivation for PA in the first place arose from wanting to axiomatize $ℕ$, so we care only about theories that can prove the basic properties of $ℕ$, and unsurprisingly these basic properties are expressed precisely by the discrete ordered semi-ring axioms plus induction.

user21820
  • 60,745
  • 1
    Ignore the downvoter, because there is nothing wrong with this answer, as any competent logician can confirm... – user21820 Oct 15 '20 at 15:01
  • The definitions seem like they are the same; how is mine different or incorrect? (Feel free to email me.) – Pietro Paparella Oct 18 '20 at 00:53
  • 3
    @PietroPaparella: The $e$ is undefined in your (iii). "$∃e$" does not define $e$, because each variable bound by a quantifier is a dummy variable and it has no meaning outside the scope of the quantifier. If you need more clarification, you can ask me in this chat-room. – user21820 Oct 18 '20 at 03:55
  • 1
    Regarding "$\exists e$", to me the real point is that there is an illegal clash of variables in both definitions in the post: the $e$ occurs first as a given constant $e \in S$; and then it occurs later as a bound variable in axiom (ii) $\exists e \in S$. – Lee Mosher Oct 23 '20 at 18:23
  • @LeeMosher: that was precisely my point... Option 1 fixes the error by freeing the variable in (ii), whereas Option 2 fixes it by binding the variable in the erroneous (iii). – user21820 Oct 23 '20 at 19:35
10

I cannot answer your second question, but I'll try answering your first.

Even if we were to work with the second definition, you can bet one of the first things any textbook author would do would be to prove that the second definition implies the first and then work with that throughout the rest of the text.

Also, the fact that the two definitions are equivalent is not really seen as important, because it is rarely the case that we start with a structure knowing it satisfies the requirements of the 'right-group' definition but unsure whether or not it is a 'true group'.

In addition, a definition like this is likely to cause more confusion to the reader. While it may be superficially leaner, people find symmetries very intuitive. Defining groups in this way makes them seem strange (even more than they already are to a lot of people) and raises a lot of questions along the lines of 'what if this rule is tweaked?' which are perfectly fine questions to ask, but they detract from what most abstract algebra books are about.

Ultimately, I think if this change in definition was introduced in a book, it could give the reader some interesting questions to think about, but will mostly just take up some extra space towards the start of the book exploring things which will be irrelevant to the rest of the book, after which the author would go on ahead using the standard definition anyway. Which is probably why it isn't used much.

(feel free to comment or edit for any corrections or suggestions)

7

This is a partial “frame challenge” answer to your Question 1. You write:

This seems strange to me given that it is desirable to make definitions as lean as possible.

“Leanness” is certainly desirable, other things being equal, but it is not the only desirable criterion for definitions, nor the predominant one. Other important ones include naturalness, comprehensibility, and generalisability; and all these make the standard definition of a group preferable. Looking at a few specific issues:

  • The standard definition is symmetric. The presentations you suggest introduce an artificial asymmetry in the axiomatisation — of course it still follows that the resulting theory is symmetric, but the axiomatisation hides this.

  • In examples of groups, we think of the inverses and units as two-sided, because they are. It’s only natural to split up the left- and right-handed versions in examples where they really do diverge. So the standard presentation better fits how we view examples.

  • As Arturo Magidin’s answer details, the standard definition generalises better when we move to weaker structures — monoids, and so on, where the one-sided and two-sided notions really aren’t equivalent.

These are all a bit subjective, but nonetheless very real and important when choosing a definition.

For comparison, Higman and Neumann (following earlier work of Tarski) showed that groups can be axiomatised using a single operation $x/y$ “division”, and a single axiom: $x / ((((x / x) / y) / z) / (((x / x) / x) / z)) = y$. This is certainly “leaner” than either the standard presentation or your single-sided versions; but is also much less comprehensible, natural, or generalisable. I guess you’d agree that this shouldn’t be given as the primary definition of a group. Your suggestion is certainly much better than this one; but I think most mathematicians would agree that for most purposes, it’s slightly less clear and natural than the standard definition, and so the standard one remains preferable.

2

Deliberately not an answer

I think it's not a bad idea to present the symmetric definition, work with groups a little, and then have an exercise saying "Here's an alternative definition; show that it implies our definition." And that exercise could be followed with a discussion of how/why we choose certain definitions [perhaps cribbed from some of the answers here].

In a course where you're trying to train budding mathematicians, you assign this problem. In a course where you're just trying to teach some algebra (e.g., to folks who need to know it for crystallography), you skip it.

And in a course where you're planning to discuss how to build a hierarchy for the formalization of mathematics and an implementation using computers, like that in Isabelle/HOL, for instance, you can go nuts talking about all the different possibilities, and how each one aids/hinders your development in further aspects of algebra.

John Hughes
  • 100,827
  • 4
  • 86
  • 159