This is a very good and fair question. I confess that for a long time I thought that there was no way you could gain any insight from a representation that you couldn’t gain from grappling with the group itself directly, simply because all you are doing is considering a homomorphic image of the group. It took a long time to get it through my head that they are actually useful.
The first step in recognizing that was to finally recognize the use of group actions, another instance in which my feeling was that you couldn’t possibly gain any new insight, given that a group action amounts to just an embedding into a symmetric group. Slowly but surely I came to understand that group actions give you a good way to understand a group because it allows you to think about the group as doing something, as opposed to as an abstract entity. You can use the way in which the group acts to understand the group. You can use it to identify subgroups as stabilizers, you can use it to recognize that something is not normal by how the action works, etc. It may take a while, but it turns out that it gives another way to look at the group, and of course the more ways you have to think of and look at the group, the better: you can play one against the other.
Once you recognize that group actions are a way to get into the group, then you turn to actions of a group not on a set, but on a structured set, and in particular actions that respect the structure. That gives you even more handles on the action. And as it happens, linear transformations are some of the best understood and most fruitful instances of “actions on a structured set”. So that trying to understand groups via their actions on vector spaces becomes something to try/want to have.
In addition to this, there is another important philosophy in algebra: you understand an algebraic object better by how it maps and acts on other things than just by staring at the object. So you will have more luck in understanding a ring by looking at its category of modules than just by staring at the ring; and you have more luck in understanding a group by considering homomorphic images of the group and group actions than just by staring at the group.
Once you combine those things, you end up with the idea that you want to add structural handles to the group because it gives you more “ways in”; it helps you see the group “in action” instead of as a static abstract object. And that the best kinds of actions you can look at are the ones that act on sets with lots of structure, and are best understood. (Felix Klein for example wanted to study groups as “systems of symmetries” of some objects/geometry/etc).
Once you land there, linear representations are an obvious thing to try. There is so much structure in the image that you can still transport back to the original group that you end up being able to gain a lot of information about the group by how it acts on vector spaces.
This approach proved useful early on; there are two famous results that were originally proven with representation theory that took a long time to prove without (and that turned out to require a lot of very strong machinery to prove without): I confess I can’t recall the second one, but the one I always remember is Burnside’s $p^{\alpha}q^{\beta}$ theorem, which was not proven without representation theory until after the Odd Order Theorem, and with many of those same ideas and techniques. Isaac’s recent book in Group Theory (not the one on character theory) can be seen as a book-long exposition of developing a lot of those tools with the objective of culminating with that representation-theory-free proof.
So, why does this “new ‘fake’ structure” help us understand the group? Because it gives us more ways to look at the group, more ways to think about the group elements, and a different way to conceptualize the group, not just as a set with an operation, but as an object that is acting on a structured object in well-understood ways.