The way I think and teach about IFT is the following. Largely geometric/visual, so may be not to everybody's liking.
Let us ignore the separation into $(h,k)$ parts. Just consider a point $\mathbf{p}\in\Bbb{R}^{n+m}$. Let $f_1,f_2,\ldots,$ be some functions from a neighborhood of $\mathbf{p}$ to the reals, all vanishing at $\mathbf{p}$. Let us select a finite non-empty subset $S$ of those functions, and let $V(S)$ be the set of common zeros of the functions in $S$. We call a vector $\vec{w}$ a tangent of $V(S)$ at $\mathbf{p}$, if there exists a differentiable path $\gamma:(-a,a)\to V(S)$ for some $a>0$ such that $\gamma(0)=\mathbf{p}$ and $\gamma'(0)=\vec{w}$.
Implicit function theorem states (essentially) that if the gradients $\nabla f(\mathbf{p}), f\in S,$ are linearly independent, then the tangent vectors of $V(S)$ at $\mathbf{p}$ form a linear subspace $L$ of $\Bbb{R}^{n+m}$. Moreover that subspace consists of the vectors orthogonal to the gradients $\nabla f(\mathbf{p}), f\in S$. The tangent space $T(S,\mathbf{p})$ of $V(S)$ at $\mathbf{p}$ is translated to pass through $\mathbf{p}$, so it is the coset of a linear subspace $T(S,\mathbf{p})=L+\mathbf{p}$

In the above figure the point $\mathbf{p}$ is the black dot (barely visible through the semi-opaque surfaces). The set $S$ consists of two functions, and their zero sets are the section of a vertical cylindrical tube (quarter of a tube to be more precise) and that light blue slightly bent surface. The vertical plane and th slanting plane are their respective tangent planes at $\mathbf{p}$. The intersection $V(S)$ is that solid black curve, and its tangent space is that red line that is also the intersection of the two tangent planes.
This brings us to a following one-line summary of IFT:
The tangent space of the intersection is the intersection of the tangent spaces.
What does that got to do with the formula for the partial derivatives of the "coordinates to be solved" with respect to the "coordinates viewed as independent variables"?
Well. When those gradients are linearly independent, then the above description of the tangent space tells that it consists of solutions of an underdetermined system of linear equations. Finding its general solution was done in linear algebra. For example we remember that we can solve a subset of $m$ coordinates from those $m$ linear equations iff the corresponding $m\times m$ matrix consisting of the relevant columns is invertible.
What IFT says is that the same holds locally in the differentiable (but no longer linear) case. Furthermore, all those partial derivatives at $\mathbf{p}$ are coded into the equation of that tangent plane, which can be viewed as the graph of an affine linear function from $\Bbb{R}^n$ to $\Bbb{R}^m$.