Technology Industry
Industry: Email Alert RSS FeedDetection and classification of defect patterns on semiconductor wafers
IIE Transactions, Dec, 2006 by Chih-Hsuan Wang, Way Kuo, Halima Bensmail
3.3. Principle of the Gaussian EM algorithm
Gaussian-mixture-based models are commonly used as a basis for cluster analysis and Gaussian EM algorithms based on the maximum likelihood or Maximum A Posteriori (MAP) estimations are popular and powerful tools. Bensmail and Celeux (1996) provided a good alternative, called eigenvalue decomposition discriminant analysis and used it to analyze 14 discrimination models. Moreover, Bensmail et al. (1997) studied a stochastic Gaussian clustering approach. The Gaussian EM model assumes that the population of interest consists of G different subpopulations. Observations [x.sub.1], [x.sub.2],..., [x.sub.n] in [R.sup.d] (n is the number of observations andd denotes the input dimension), are assumed to arise from a random vector with a joint probability density of:
Most RecentTechnology Articles
- The Google Manifesto: Dr. Open and Mr. Closed
- RIM Is Getting Too Successful for Its Customers' Good
- Tech Law: Google Loses in France, GPL Suits Target Many, IBM Sued, More
- Microsoft Moves Fast, Already Has Custom XML Patch for Word
- Microsoft Might Get Advantage or Pain from Order To Not Sell Word
- More »
f(x; [theta]) = [G.summation over (k=1)] [p.sub.k][f.sub.k](x; [[theta].sub.k]), (3)
where G is the number of components and [p.sub.k] is the probability that an observation belongs to the kth component. The properties of [p.sub.k] [greater than or equal to] 0 and [[SIGMA].sub.k=1.sup.G] [p.sub.k] = 1 will hold for any observation. If mixture kernel [f.sub.[gamma].sub.i] ([x.sub.i]; [[theta].sub.k]) is MultiVariate Normal (MVN) (where [[theta].sub.k] = ([[mu].sub.k], [[SIGMA].sub.k]) and [[gamma].sub.i] = k if [x.sub.i] belongs to the kth component), the density function based on its mean [[mu].sub.k] and covariance [[SIGMA].sub.k] can be computed as:
[f.sub.k] ([x.sub.i] | [[mu].sub.k], [[SIGMA].sub.k]) = [exp{-[1/2]([x.sub.i] - [[mu].sub.k])[.sup.T] [[SIGMA].sub.k.sup.-1] ([x.sub.i] - [[mu].sub.k])}]/[2[pi][.sup.d/2]|[[SIGMA].sub.k]|[.sup.1/2]] (4)
Due to its geometric properties in a MVN distribution, the covariance matrix can be decomposed as [[SIGMA].sub.k] = [[lambda].sub.k] [D.sub.k] [A.sub.k] [D.sub.k.sup.T] (Bensmail and Celeux, 1996), where the superscript T stands for matrix transpose, [[lambda].sub.k] is the eigenvalue of its covariance (controlling the hyper-volume occupied by cluster k as [[lambda].sub.k.sup.d]|[A.sub.k]|), [D.sub.k] is its corresponding eigenvector (that decides the orientation of the principal component in cluster k) and [A.sub.k] is the diagonal matrix (that decides the shape of the covariance). Under a descending sorting of the covariance ([A.sub.k] = diag{[[alpha].sub.1k],...[[alpha].sub.dk]}, 1 = [[alpha].sub.1k] > [[alpha].sub.2k] > ... [[alpha].sub.dk] > 0), the kth cluster tends to be hyper-spherical if all diagonal elements [[alpha].sub.jk] are of similar magnitude, whereas it appears to be a line if [[alpha].sub.2k] [much less than] 1 = [[alpha].sub.1k] holds. Thus, the Gaussian kernel naturally includes two categories of defect patterns, namely the linear scratch and elliptic zone patterns. In brief, all the geometric features (shape, volume, orientation) of the mixtures are summarized by the covariance matrix [[SIGMA].sub.k]. Common instances include [[SIGMA].sub.k] = [lambda]I (I is the identity matrix), where all clusters are spherical and of the same size; [[SIGMA].sub.k] = [SIGMA] constant across clusters, where all clusters have the same geometry but need not be spherical; and unrestricted [[SIGMA].sub.k] where each cluster may have a different geometry (see Bensmail et al. (1997)). The classical approach maximizes the likelihood as:
[FIGURE 4 OMITTED]
L([theta], [gamma]|x) = [n.[product].[i=l]] [f.sub.[gamma].sub.i] ([x.sub.i]; [[theta].sub.[gamma].sub.i]).
Under the assumption of a MVN distribution, the likelihood then becomes:
L([theta], [gamma]) [proportional] [G.[product].[k=1]] [[product].[i[member of][G.sub.k]]] | [[SIGMA].sub.k]|[.sup.-1/2] exp{-1/2([x.sub.i] - [[mu].sub.k])[.sup.T] x [[SIGMA].sub.k.sup.-1] ([x.sub.i] - [[mu].sub.k])}. (5)
In comparison, the traditional K-means method is equivalent to maximizing the MVN classification likelihood when the covariance matrix is proportional to an identity matrix. Details of the expectation and maximization step (EM algorithm) are now briefly described.
CXO UnpluggedSmart Business interviews on BNET
Brought to you by CBS MoneyWatch.com
- Best- and Worst-Paid College Degrees
- 6 Things You Should Never Do on Twitter or Facebook
- How Much Sleep Do You Really Need?
- 6 Big Myths about Gas Mileage
- 5 Rules for Immediate Annuities
- Death in the Family: 12 Things to Do Now
- Dumbest Things You Do With Your Money
- 6 Online Networking Mistakes to Avoid
- 401(k) Mistakes to Avoid
- 5 Economic Scenarios to Keep You Up at Night
- The Real ‘Best Places to Retire’
- Best Credit Cards for You
- 12 Tough Questions to Ask Your Parents
- The Real ‘Best Colleges’
- Home Buyer Tax Credit: How to Cash In
- Why You Shouldn't Bash Cash
- 8 Phony 'Bargains' and Better Alternatives
- Danger: 3 Debit Card Scams to Avoid
- 6 Myths About Gas Mileage
- 29 Fees We Hate Most
- Quick and Easy Ways to Boost Returns
- Best Stocks to Buy Now
- Lower Your Taxes: 10 Moves to Make Now
- New Jobs: 8 Lessons from Real-Life Career Switchers
- The New Job Market: Who Wins and Who Loses?
- Health Care Reform's Public Option: Everything You Need to Know
- Volunteer Work When Unemployed: Should You Work for Free?
- Whose Recovery Is This?
- Long-Term-Care Insurance: 4 Biggest Risks to Avoid
Content provided in partnership with
Most Recent Business Articles
- Multiple criteria evaluation and optimization of transportation systems
- Multi-criteria analysis procedure for sustainable mobility evaluation in urban areas
- A two-leveled multi-objective symbiotic evolutionary algorithm for the hub and spoke location problem
- Multi-criteria analysis for evaluating the impacts of intelligent speed adaptation
- The development of Taiwan arterial traffic-adaptive signal control system and its field test: a Taiwan experience
Most Recent Business Publications
Most Popular Business Articles
- 7 tips for effective listening: productive listening does not occur naturally. It requires hard work and practice - Back To Basics - effective listening is a crucial skill for internal auditors
- LIFO vs. FIFO: a return to the basics
- FAS 109: a primer for non-accountants - Financial Accounting Standards Board's "Statement 109: Accounting for Income Taxes"
- Too Young to Rent a Car? - 25-years-old the minimum age for car renting - Brief Article
- Design a commission plan that drives sales - Sales Commissions




