What's the shortest distance between two cubic Bézier curves?

Question

This question comes from TeX.SX https://tex.stackexchange.com/questions/183123/whats-the-minimum-distance-between-two-bezier-curves

(From typography; TeX) We are trying to find minimum distance between two glyphs. Glyphs usually consist of one or more cubic Bézier curves, the control points are stored in the font file (OTF, TTF, PFB, DFONT, ...).

We are solving https://tex.stackexchange.com/questions/180510/how-to-get-intersection-points-of-two-glyphs and if we could find minimum distance between two Bézier curves analytically, it would be computationally effective. It would be much better method than bisection method we quite often use in typography. We use iteration over a variable/dimension most of the time.

A mathematician told me that this approach could lead into a horrible system of equations. That's likely reason we use numerical methods instead.

One of my (poor) ideas is to convert Bézier curves to spiral curves (see application in FontForge), but it might lead us into an even worse situation (from mathematical point of view).

My next idea is to split up Bézier curves into smaller parts, but it's probably not improving a thing.

My question is probably duplicate to How can I tell when two cubic Bézier curves intersect? and it is related to Shortest distance between two shapes. In computer science this problem is related to the collision detection.

score 3 · Answer 1 · edited Apr 13 '17 at 12:20

People in the CAD business have been intersecting Bezier curves, finding distances, etc. for decades. See these notes or section 5.6.2 of this book for starters. Also, this question. It always amazes me that people in font world tend to invent their own approaches, instead of using what the CAD folks developed.

You have to solve polynomial equations of moderate degree (4, 5, 6 or so). I wouldn't characterise them as "horrible" -- at least they are polynomials. Numerical methods are used to solve them. The common approaches are:

(1) Discretize (replace the curves with sequences of short straight lines), or

(2) Standard root-finding methods, like Newton-Raphson. These work very well if you can find good starting points, which you usually can. If the two curves are $F(u)$ and $G(v)$, then, to find the values of $u$ and $v$ at their closest points, you have to find the roots of $(F(u)-G(v))\cdot F'(u)=0$ and $(F(u)-G(v))\cdot G'(v)=0$.

(3) Subdivision techniques. You can regard these as either intelligent adaptive discretization, or as root finding by the secant method.

score 2 · Answer 2 · answered Jun 05 '14 at 05:10

This is a hard problem. Here are two possible solutions:

Choose $n$ equally spaced points on each curve. These define $n^2$ distances; choose the smallest of these. For large $n$ this will likely be close to the correct answer.
Pick a point $a_1$ on Bezier curve A. Then choose the closest point $b_1$ on Bezier curve B. This involves finding the root to a fifth degree polynomial, which can only be done approximately in general. Then, find the closest point $a_2$ (to $b_1$) on Bezier curve A using the same method. Then, find the closest point $b_2$ (to $a_2$) on Bezier curve B using the same method. Hopefully this process will converge, and will yield the desired two points.

Also, this recent paper gives an algorithm that it claims is quite good. — vadim123, Jun 05 '14 at 05:19
Algorithm 2 will only find a local minimum, not a global one. — Krzysztof Kosiński, Mar 31 '15 at 16:36

Tatarize · Answer 3 · 2017-09-22T06:41:40.800

A few things to realize. The Newton method will be wrong at times too. You will have points in time when the closest distance is not at any root. That calculating out the roots and checking those will give you the wrong answer. And there's exactly nothing you could do about it.

The naive discretize method is going to be $n^2$ time with how close you want the values, and is going to bog down or be more wrong. But you can improve this a lot. I would strongly suggest variable resolution for that method, where you find the nearest point in the two graphs over k-discrete values. Then if you find that say values at 43 and 22 are the closest, you do the same thing you just did but within the range of 42-44 and 21-23 within the two graphs rather than for the whole graph. So if you want precision at the at 10,000th of the graph, you do not perform 100,000,000 distance checks, you perform 50,000 distance checks, by subdividing the graph into 100 parts, finding the then subdividing the ranges around those closest points into 200 points different distances. You can obviously also speed this up by knowing the order of the graphs and knowing that it can't spontaneously move closer more times than it has roots via the Bezout theorem. Or any of the other ways to find the nearest point to a bezier curve (which is a much easier problem).

As far as the computer science goes, I think clearly the best method is to do bezier subdivision of both curves, finding the bounding boxes that are nearest to each other, while you intelligently stop checking those bounding boxes that can be overtly ruled out. Namely if the closest part of the bounding box is further than the most far elements of the closest known bounding box pair, then that entire part of the graph cannot possibly contain the shortest distance, if we plead total ignorance then the graph could be anywhere in a bounding box. Once we divide the graph a few times, there would be ample resolution that we could easily show examples where even the best case scenario between a pairing is worse than the worst case scenario of another pairing. Thus, we know that pairing cannot possibly have the closest point. Further, if we can show this is true between some bounding box in one graph and all the bounding boxes of the other graph that entire section of the graph is proven invalid.

This method would actually converge fairly quickly since we'd be able to quickly get rid of entire sections of the graph, or in the case of many graphs, entire graphs even without needing to check them beyond a bounding box check. In the case where the two graphs have an intersection, the overlapping bounding boxes would quickly be subject to repeated divisions and finding a worst case scenario of a distance of zero or epsilon. And thus disregard all the rest of the graph unless there exists another intersection.

This also would easily get combined with a bunch of very nice and ultimately clever tricks that can be done to determine distances and overlap within the Axis Aligned Bounding Boxes (AABB) of computational geometry. While it's certainly the case that the graph of a bezier curve categorically is located within the convex hull polygon define by the points, and the bounding box contains more area, it's easier and likely computationally better to just get the bounding box for the curve. Especially when we consider a number of great tricks that can let us locate the nearest rectangle in less than $n$ time. If this sort of thing is mission critical you could certainly realize enough tricks to find all correct answers closer than some arbitrary level of error within imperceptibly short periods of time.

What's the shortest distance between two cubic Bézier curves?

3 Answers3