The original poster left a comment requesting an Inclusion-Exclusion solution. This is given below. For what it's worth, I regard Inclusion-Exclusion to be an inferior method of attack, for this particular problem.
See this article for an
introduction to Inclusion-Exclusion.
Then, see this answer for an explanation of and justification for the Inclusion-Exclusion formula.
Following the syntax in the 2nd link above, let $~S~$ denote the set of all possible words, that contain no more than $~2~$ vowels.
Let $~S_a~$ denote the subset of $~S~$ that contains exactly one "a".
Let $~S_e~$ denote the subset of $~S~$ that contains exactly one "e".
Then, you want
$$| ~S_a \cup S_e ~| - | ~S_a \cap S_e ~|. \tag1 $$
Further, by inclusion exclusion theory,
$$| ~S_a \cup S_e ~| = | ~S_a ~| + | ~S_e ~| - | ~S_a \cap S_e ~|. \tag2 $$
Putting (1) and (2) above together, the desired computation is
$$| ~S_a ~| + | ~S_e ~| - 2| ~S_a \cap S_e ~|. \tag3 $$
Note
More usual in similar Inclusion-Exclusion problems is to define $~S~$ to equal the set of all words, without any regard for how many vowels there are. Unfortunately, with the more usual definition of $~S,~$ you would have (for example) the subset $~S_a~$ including words with exactly $~1~$ "a", but more than $~1~$ "e".
In such an event, the formula shown in (1) above would not represent the desired computation, so the formula shown in (3) above would also not represent the desired computation (i.e. the answer to the problem). So, to make the inferior Inclusion-Exclusion approach work, I resorted to the somewhat artificial definition of $~S.$
By symmetry,
$$| ~S_a ~| = | ~S_b ~| $$
$$= \binom{4}{1} \times \left\{ ~\left[ ~\binom{3}{1} \times 3^2 ~\right] + \left[ ~\binom{3}{0} \times 3^3 ~\right] ~\right\} = 216.$$
That is, there are $~4~$ choices for where to put the "a" and then two mutually exclusive possibilities:
$~3~$ choices for where to put the "e", and then $~3^2~$ ways of completing the word.
No "e" is used, so there are $~3^3~$ ways of completing the word.
Also,
$$| ~S_a \cap S_b ~| = \binom{4}{1} \times \binom{3}{1} \times 3^2 = 108.$$
That is, there are $~4~$ choices for where to put the "a", then $~3~$ choices for where to put the "e", and then $~3^2~$ ways of completing the word.
So, the final computation is
$$2 \times [ ~216 - 108 ~] = 216.$$