Document image analysis by probabilistic network and circuit diagram extraction
Informatica, Oct, 2005 by Andras Barta, Istvan Vajk
[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII.] (5)
T is an operator that performs an orthogonal linear transformation on the image bases. The parameters of the transformation are stored in the [r.sub.i] parameter vector. The image bases may be parameterized by an [a.sub.i] attribute vector. Since features belong to parameterized feature classes the [a.sub.i] vector is necessary to identify their parameters. This description defines a tree structure. The tree is constructed from its nodes and a library. The library is a list of common, frequently used image bases. Figure 2 illustrates the object tree and the library.
[FIGURE 2 OMITTED]
Visual information is inherently spatially ordered, so the tree is defined to represent these spatial relationships. This transformation has three components, displacement, rotation and scaling. The four parameters of the transformation of node i are placed in a reference vector
[r.sub.i]= [x.sup.r.sub.i] [s.sup.r.sub.i] [[phi].sup.r.sub.i] (6)
where [x.sup.r.sub.i] = [[x.sup.r.sub.i] [y.sup.r.sub.i] is the position of the image element in the coordinate system of its parent node, [s.sup.r.sub.i] is the scaling parameter and [[phi].sup.r.sub.i] is the rotation angle. With the object tree, the object library and the image coordinate system the object can be reconstructed. A picture element or a feature is represented in its own local coordinate system. Since only two-dimensional objects are used therefore the scale factor is the same for both axes. Each image base is defined in a unit coordinate system and stored in the library. When the image of an object is reconstructed the image base is transformed from the library to a new coordinate system, which can be described by the vector; [i.sub.i] = [[x.sub.i] [s.sub.i] [[phi].sub.i]]. This coordinate system is calculated from the [r.sub.i] reference vector of the node and the image coordinate system of the parent node, [i.sub.i-1]. This is a recursive reconstruction that iterates through the tree.
[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII.] (7)
With this reconstruction algorithm the tree representation of an object can be compared against the image.
4.2 Network Parameters
The network structure is determined by the p(y | x) conditional probabilities, where x,y are nodes of the network. Conditionally independent nodes are not connected by edge. In order to define the network the p(y | x) parameters have to be calculated. These parameters can be assessed based on experimental training data. In our case of document processing the network is trained on circuit diagrams. Here, it is assumed that the image bases of an object description are independent. The probability parameters [[theta].sub.i,j] are learned as relative frequencies. It can be shown that the distribution of the [[theta].sub.i,j] parameters is a Dirichlet distribution [22], [4]. The conditional probabilities of the network can be described by
[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII.] (8)
where [n.sub.k] is the number of time node k occurs in the sample data and n = [summation of (L k=1)] [n.sub.k is the sample size. The [GAMMA](x) function for integer values is the factorial function, [GAMMA](x) = (x-1)!. The parameters of the Dirichlet distribution correspond to the physical probabilities and the relative frequencies,
- 5 Rules for Immediate Annuities
- Death in the Family: 12 Things to Do Now
- Dumbest Things You Do With Your Money
- 6 Online Networking Mistakes to Avoid
- 401(k) Mistakes to Avoid
- 5 Economic Scenarios to Keep You Up at Night
- The Real ‘Best Places to Retire’
- Best Credit Cards for You
- 12 Tough Questions to Ask Your Parents
- The Real ‘Best Colleges’
- Home Buyer Tax Credit: How to Cash In
- Why You Shouldn't Bash Cash
- 8 Phony 'Bargains' and Better Alternatives
- Danger: 3 Debit Card Scams to Avoid
- 6 Myths About Gas Mileage
- 29 Fees We Hate Most
- Quick and Easy Ways to Boost Returns
- Best Stocks to Buy Now
- Lower Your Taxes: 10 Moves to Make Now
- New Jobs: 8 Lessons from Real-Life Career Switchers
- The New Job Market: Who Wins and Who Loses?
- Health Care Reform's Public Option: Everything You Need to Know
- Volunteer Work When Unemployed: Should You Work for Free?
- Whose Recovery Is This?
- Long-Term-Care Insurance: 4 Biggest Risks to Avoid
Content provided in partnership with
Most Recent Business Articles
Most Recent Business Publications
Most Popular Business Articles
- 7 tips for effective listening: productive listening does not occur naturally. It requires hard work and practice - Back To Basics - effective listening is a crucial skill for internal auditors
- LIFO vs. FIFO: a return to the basics
- FAS 109: a primer for non-accountants - Financial Accounting Standards Board's "Statement 109: Accounting for Income Taxes"
- Using object-oriented analysis and design over traditional structured analysis and design
- Design a commission plan that drives sales - Sales Commissions


