A Domain_Independent System for
Sketch Recognition
This
paper primarily introduces a domain-independent sketch recognition system,
which uses direction graph and curvature graph to identify primitive shapes,
uses feature-area verification for double check and post-process to complete
hierarchical output.
Let’s
start from imprecise stroke approximation:
Initial
stroke approximation is basically done with the help of direction graph and
curvature graph.
These
two graphs are obtained by analyzing the sampling points on the original
stroke.
where dn is the direction between point
(n+1) and point n. cn is the curvature between point (n-k) and
(n+k). The nominator is the total direction change between point (n-k) and
point (n+k), and the denominator is the path length between these two points.
Then Direction graph and Curvature graph can be
obtained:
For example,
For the Star Stroke, there are four times change of
direction. So basically, there are four big changes in direction graph.
Feature area is used for stroke verification. It works
like this:
The shaded area is the difference between original
stroke and reference stroke. When the difference is very small, it basically
indicates that the reference shape is exactly the one indicated by the original
stroke.
From now on, we will dive into specific cases on how
to proceed with primitive shape detection:
For vertex detection:
When an input stroke cannot be approximated with any
kind of primitive shape, we will divide the stroke into two sub-strokes. The
dividing point will be the point with highest curvature value in current stroke.
For line detection:
Generally, there are two ways for line detection. The
first one is to check whether the direction curve can be fitted with a
horizontal line in direction graph. The second one is to check whether the
actual coordinates of the stroke can be fitted by a line. Then we can use
feature-area verification for double check, just as shown in figure 3(a)
For curve detection:
If a line with non-zero slope shows in direction
graph, the stroke is likely to be a curve. If the slope is close to 2*pi/N,
then it is likely to be a circle. If succeeds, we hypothesize a candidate
circle which takes the center of the stroke’s bounding box as its
own center and the mean distance between the center and each
stroke point as its radius. If the difference between the stroke and
candidate circle is within the reasonable threshold, we can identify the stroke
as a circle.
For arc detection:
The algorithm for arc recognition is devised as
follows:
(1) Check whether the direction curve can be approximated with a line, and abort if fails.
(2) Find the perpendicular bisector, denoted as M, of the first and last stroke points, calculate the intersection point A between line M and the stroke path, and then find the circle on which the first and last stroke points as well as point A reside. The circle’s center is taken as the center of
the candidate arc.
(3) Take the mean distance from each stroke point to the center as the radius.
(4) Compute the ratio between the corresponding feature area against the candidate arc and that arc’s standard area. Check whether the ratio is less than a predefined threshold, and abort if fails.
(1) Check whether the direction curve can be approximated with a line, and abort if fails.
(2) Find the perpendicular bisector, denoted as M, of the first and last stroke points, calculate the intersection point A between line M and the stroke path, and then find the circle on which the first and last stroke points as well as point A reside. The circle’s center is taken as the center of
the candidate arc.
(3) Take the mean distance from each stroke point to the center as the radius.
(4) Compute the ratio between the corresponding feature area against the candidate arc and that arc’s standard area. Check whether the ratio is less than a predefined threshold, and abort if fails.
Handling
self-intersection Strokes
Given a stroke which cannot be approximated by any of our predefined primitive shapes, if it has no self-intersection point, then follow the original procedure; otherwise, make two copies of that stroke, divide them using different strategies, i.e. from self-intersection points versus from the point with maximum curvature, then compare respective recognition result and choose the better one.
Given a stroke which cannot be approximated by any of our predefined primitive shapes, if it has no self-intersection point, then follow the original procedure; otherwise, make two copies of that stroke, divide them using different strategies, i.e. from self-intersection points versus from the point with maximum curvature, then compare respective recognition result and choose the better one.
Post-Process:
In this stage, we need to do simple relation
retrieval, cleanup and basic object recognition. Then everything is done, we
can output our sketch recognition result.
Bibliography:
http://dl.acm.org/citation.cfm?id=604499
没有评论:
发表评论