JP4448653B2

JP4448653B2 - Data analysis method

Info

Publication number: JP4448653B2
Application number: JP2002353833A
Authority: JP
Inventors: 英隆津田; 英大白井
Original assignee: Fujitsu Semiconductor Ltd
Current assignee: Fujitsu Semiconductor Ltd
Priority date: 2002-12-05
Filing date: 2002-12-05
Publication date: 2010-04-14
Anticipated expiration: 2022-12-05
Also published as: JP2004186562A

Description

【０００１】
【発明の属する技術分野】
この発明は、広く産業界で取り扱われるデータ間の関連を把握し、産業上優位な結果をもたらすための有意性のある結果を抽出するデータ解析方法に関し、さらに、解析対象とするデータ値やその平均値だけに注目していては判別が困難である知識や情報を抽出するデータ解析方法に関する。
【０００２】
特に、２次元のデータ解析を可能とし、たとえば、解析対象としてウェーハの面上に配置されたチップの特性値等のデータと、ウェーハ面上における各チップの２次元の配置位置によりデータ分布特徴量を得て解析対象の２次元のデータ解析を効率的におこなうデータ解析方法に関する。
【０００３】
【従来の技術】
解析対象としてウェーハ上のチップの歩留り解析を例にとって説明する。特に、プロセスデータ解析のように、その解析結果から品質、生産性向上の対策決定のための参考データを得ようとする場合には、その精度、信頼度等が重要であり、これについては本願発明者等により既に出願されている（たとえば、下記特許文献１参照。ほか、特願２００１−１２７５３４）。
【０００４】
歩留り低下要因をできるだけ速やかに見つけて対策を実施するために、装置履歴、試験結果、設計情報、各種測定データ等から歩留りに効いている要因やその要因に効いている別の要因を見つけるためのデータ解析がおこなわれる。
【０００５】
データ解析において、歩留り値のように解析対象となるものを目的変数、目的変数の要因となる装置履歴、試験結果、設計情報、各種測定データ等は説明変数といわれる。その際に各種統計学的手法が適用されるが、そのうちの一つとしてデータマイニングを適用することで、多種大量のデータから判別しにくい価値ある情報や規則性を抽出することができる。
【０００６】
半導体デバイスの不良要因を解析するためには、収集されたデータをより多面的に科学的根拠に基づいて解析し、より多くの有意差を抽出するのが重要である。そのため、従来は計算機システムに蓄積されたオリジナルデータの値やその平均値がよく活用されている。しかし、複雑に絡み合ったオリジナルデータ群から不良要因等を抽出するのが困難な場合もある。そのような場合、ウェーハ面内チップやロット内ウェーハの各種測定結果や歩留り等に関して特徴的なデータ分布が存在すれば、それに基づいて不良データの解析を進めることができる。
【０００７】
半導体デバイスの歩留りに関するデータ解析をおこなうには、歩留り値のように計算機システムに蓄積されているデータ値を用いる他に、これらのデータ値を編集した値、たとえば、ロット内ウェーハ歩留り値の分布特徴を活用することも有用である。本出願人は、ロット内データ分布特徴という１次元の特徴量を対象としてデータ解析をおこなう構成を出願している（特願２００１−３３８１８５）。
【０００８】
図９は、従来のデータ解析時に用いる表示画面を示す図である。従来、ウェーハ面内における２次元のデータ分布特徴の解析は、図示のような表示画面を用いておこなっていた。図示の例は、指定されたロット（複数ロットの指定も可能）のウェーハ面内における歩留りの分布状態をイメージ表示したものである。表示画面１００の左側１００ａには、ウェーハＷの面内を複数枠１０１に分割し、各枠１０１、すなわち、ウェーハＷの同一位置にそれぞれ存在するチップの各歩留りが示されている。
【０００９】
また、表示画面１００の右側上部１００ｂには、半導体製造プロセスの焼き付けショット時における２つのショット位置を例とした場合における歩留りがイメージおよび数値で表示されている。また、表示画面１００の右側中央部１００ｃには、ウェーハＷの中心から所定幅ずつ同じ径方向に輪切り（計８領域）に分割したとき、各分割領域に属するチップの歩留りがイメージおよび数値で表示されている。また、表示画面１００の右側下部１００ｄには、ウェーハＷを縦横および斜め方向（計８領域）に分割したとき、各分割領域に属するチップの歩留りがイメージおよび数値で表示されている。各イメージ内のマークは、大きいほど歩留りが高いことを示している。
【００１０】
次に、図１０は、特性試験で合格とされたチップのウェーハ面内分布を示す図である。図示において不良チップは空白で示されるが、この不良チップの発生状態は、ウェーハＷの面内において一様に分布しているのではなく、特徴があることが示されている。
【００１１】
そして、ウェーハ面内の同一位置に属するチップやウェーハ面内を前述した各分割パターンにしたがい、固定的に分割した各領域について、各領域に属するチップの属性値を集計し、解析者が上記表示画面を用いて視覚的に確認することができる。この他にもウェーハ面内のデータ分布特徴には、より多くの解析にとって有用な情報が存在している。
【００１２】
【特許文献１】
特開２００１−３０６９９９号公報
【００１３】
【発明が解決しようとする課題】
しかしながら、実際にはデータ数値の不連続性や、複数の各領域にまたがる特徴が存在するなど、特に、ウェーハ面内における２次元の特徴量抽出となると、解析者がパターン認知する能力を有していても有効な情報を容易に認識することができなかった。加えて、このような２次元の特徴量を全ロットおよびウェーハについてすべて視覚的に確認する作業は膨大な作業量となった。
【００１４】
したがって、図９に示したパターンを用いるだけに限らず、より汎用的なデータ分布の２次元特徴量を自動的に抽出する技術、および抽出されたデータ分布の２次元の特徴量を解析する方法が求められていた。また、従来、半導体の歩留りのデータ解析では、ウェーハ面内のデータ分布特徴、すなわち２次元の特徴量を用いた解析は、技術者の経験やノウハウが必要とされていた。
【００１５】
この発明は、上記問題点に鑑みてなされたものであって、技術者の経験、ノウハウに依存する割合を減らし２次元のデータ分布特徴量を容易に定義、抽出でき、抽出されたデータ分布特徴量を用いてデータ解析を効率的におこなうことができるデータ解析方法を提供することを目的とする。
【００１６】
【課題を解決するための手段】
上記目的を達成するため、本発明は、解析対象のデータ解析に必要なウェーハ上のチップ歩留まりデータをオリジナルデータの中から抽出するデータ抽出工程と、前記データ抽出工程により抽出されたウェーハ上のチップ歩留まりデータが解析対象の２次元座標上であらかじめ設定された複数の帰属領域のうちのどの帰属領域に該当するかを判定する帰属領域判定工程と、前記データ抽出工程によって抽出されたウェーハ上のチップ歩留まりデータを用いて前記帰属領域判定工程によって判定された帰属領域別に２次元のデータ分布特徴量を抽出する２次元データ分布特徴量抽出工程と、前記２次元データ分布特徴量抽出工程によって抽出された２次元のデータ分布特徴量を用いたデータの解析をおこなうデータ解析工程とを含み、前記２次元データ分布特徴量抽出工程は、前記２次元のデータ分布特徴量を、複数の前記帰属領域を２つの領域に２分割し、分割した２つの領域の代表値を用いた所定の算出式に基づき連続値として算出することで前記２次元のデータ分布特徴量を抽出することを特徴とする。
【００１７】
この発明によれば、解析対象の２次元の特徴量抽出の対象の個々のデータがどの分割領域に属しているかを、個々のデータの２次元座標を抽出し、それが各領域分割パターン毎にどの領域に属するかを判定することにより、連続値として得ることができる。そして、２次元の座標上で特徴量が一様に分布していない場合等、この２次元のデータ分布の特徴量を容易に抽出でき、データ解析を効率的におこなえるようになる。
【００１８】
【発明の実施の形態】
以下に添付図面を参照して、この発明にかかるデータ解析方法の好適な実施の形態を詳細に説明する。この実施の形態では、解析対象の具体例としてウェーハの面内における２次元のデータ分布特徴を抽出し、データ解析する例を説明する。
【００１９】
図１は、本発明の実施の形態にかかるデータ解析方法の実施に供せられる計算機システムのハードウェア構成の一例を示す図である。この計算機システムは、図１に示すように、入力装置１、中央処理装置２、出力装置３および記憶装置４から構成される。
【００２０】
図２は、図１に示す構成の計算機システムにより実現されるデータ解析装置の機能構成の一例を示すブロック図である。このデータ解析装置は、図２に示すように、複数のオリジナルデータを含むデータベース４１からなるオリジナルデータ群４２を有する。このデータベース４１は、図１に示す計算機システムの記憶装置４において構築されている。
【００２１】
また、データ解析装置は、オリジナルデータ群４２の中に存在する１以上のデータの分布特徴を定量的に評価して抽出する（データ分布特徴の抽出）手段２１、抽出した１以上のデータ分布特徴量の中から解析対象とする特徴量を選択する（解析対象特徴量の選択）手段２２、解析対象に選択したデータ分布特徴量を目的変数として回帰木分析手法などによるデータマイニングをおこない、データ分布に潜む特徴や規則性などのルールファイル２４を抽出する（データマイニング）手段２３、抽出した特徴、規則性（ルールのファイル）２４を用いてオリジナルデータの分布特徴を解析する統計解析コンポーネント２５や図表作成コンポーネント２６などの解析ツール群２７を備える。
【００２２】
以上の各手段２１，２２，２３および解析ツール群２７は、それぞれの処理をおこなうためのプログラムを中央処理装置２で実行することにより実現される。抽出されたルールファイル２４は記憶装置４に記憶されるとともに、表示装置や印刷装置などの出力装置３により出力される。意思決定５は解析ツール群２７による解析結果に基づいてなされる。
【００２３】
また、上述したルールファイル２４を抽出する手段２３は、オリジナルデータ群４２の中のオリジナルデータ、データ分布特徴の抽出手段２１により抽出された２次元のデータ分布特徴、または解析ツール群２７による解析結果に対してもデータマイニングをおこなうようになっている。また、解析ツール群２７は、オリジナルデータ群４２の中のオリジナルデータ、データ分布特徴の抽出手段２１により抽出された２次元のデータ分布特徴、または解析ツール群２７の出力結果に対しても解析をおこなうようになっている。また、解析ツール群２７による解析結果は、解析対象特徴量の選択手段２２やオリジナルデータ群４２にフィードバックされる。また、オリジナルデータ群４２には、データ分布特徴の抽出手段２１の出力がフィードバックされる。
【００２４】
ここでデータ分布特徴の抽出手段２１における特徴量の抽出について説明する。前述したロット内のウェーハ歩留り値等の分布特徴は、１次元における特徴量であり、この１次元の特徴量は、このようなロット内のウェーハ歩留り、ウェーハの電気的特性値、あるいは時間変動する歩留り値や電気的特性値等から計算される。特徴量抽出の対象の個々のデータの属性値（ロット内のウェーハ番号、時刻等）を、特徴量抽出のための変数としてそのまま使用する。
【００２５】
これに対し、この発明のデータ解析装置では、２次元の特徴量を抽出する。この２次元の特徴量は、ウェーハ内チップ、チップ群の歩留り、電気的特性値等から、各分割領域毎にまとめて計算される。このため、特徴量抽出の対象の個々のデータがどの分割領域に属しているかの判定が、個々の分割パターンにより必要となる。個々のデータの２次元座標を抽出し、それが各領域分割パターン毎にどの領域に属するかを判定する必要が生じる。
【００２６】
図３は、２次元のデータ分布特徴の抽出手段２１が抽出する分布特徴の定義を示す図である。分布特徴の対象は、ウェーハＷを所定の分割方法で分割した際の各領域の属性値とする。そして、分割タイプは複数種類用意される。図３（ａ）に示す分割タイプは、ウェーハＷの面内を中央と４方に計５分割した例である。図３（ｂ）に示す分割タイプは、ウェーハＷの面内を中心から８等分した例である。図３（ｃ）に示す分割タイプは、ウェーハＷの面内を中心から所定径毎に円形状（円環状）に３分割した例である。上記例に限らず、他にも不良要因を絞り込むのに適した分割タイプの追加および削除が可能である。
【００２７】
後述するデータ解析処理では、各分割タイプそれぞれについて、どの２分割領域およびその組み合わせが他の領域や組み合わせに比べて対象とするデータ値についてどの程度の有意差が見いだせるかを判別するようになっている。
【００２８】
次に、２次元のデータ分布特徴を数値化する手段について説明する。図３（ａ）に示した分割タイプを例に具体的に説明する。属性値の大小を考慮した計５つの領域（ＺＯＮＥ１〜ＺＯＮＥ５）の２分割方法の組み合わせは、３０通り（＝ ₅Ｃ₁＋₅Ｃ₂＋₅Ｃ₃＋₅Ｃ₄）となる。ここで、ｘｉを各領域でのデータ値とし、全５領域を任意の２領域に分割する。図４は、複数領域を２分割した例を示す図である。図示の例では、２分割した領域の一方ａ（データ値がｘｈ）をＺＯＮＥ１，２，３とし、他方ｂ（データ値がｘｌ）をＺＯＮＥ４，５とした場合である。
【００２９】
ここで、
ｘｉ：全領域（ＺＯＮＥ１〜５）のｘｉの平均値
ｘ₁〜₃：一方の領域（ｘｈ：ＺＯＮＥ１，２，３）のｘｉの平均値
ｘ₄〜₅：他方の領域（ｘｌ：ＺＯＮＥ４，５）のｘｉの平均値
とすれば、図４に示す各分割領域ｘｈ，ｘｌの平均値ｘ₁〜₃と、ｘ₄〜₅は、ａ，ｂを変数としてそれぞれ、下記式（１），（２）で表される。
【００３０】
【数１】

【００３１】
となる。ここで、２分割したａ，ｂの領域について、（ａ−ｂ）は、３０の領域分割方法すべてについて得られるので、これを各分割方法についての特徴量として定量的に定義でき、（ａ−ｂ）の値により、ウェーハＷの面上における２次元のデータ分布特徴量が定義できる。ここで、特徴量としては、領域ａ，ｂそれぞれの代表値（たとえば、平均値，中央値，最小値，最大値，標準偏差等）を用い、上記のような領域ａ，ｂの差（ａ−ｂ）、比（ａ：ｂ）を用いることができる。また、いずれの場合でも上記数式等の演算により、値として連続値を得ることができる。
【００３２】
以上の説明は、図３（ａ）に示した分割タイプについての２次元のデータ分布特徴量の算出に関するものであるが、図３（ｂ）、（ｃ）についても同様に上記式（１）、（２）で算出することができる。すなわち、領域の分割タイプが異なっても、差（ａ−ｂ）の値は同様な意味を持つため、異なった領域の分割タイプにより得られた差（ａ−ｂ）の値であっても特徴量としては同等に扱うことができる。したがって、各ウェーハＷについて図３（ａ）〜（ｃ）に示す領域の分割タイプ毎ではなく、総合的にみてどの領域の分割タイプのどの分割方法による特徴が最も顕著であるかを判別することが可能となる。
【００３３】
上記ａ，ｂの領域の差（ａ−ｂ）については、絶対値｜ａ−ｂ｜の値が大きいほど、データ分布特徴が顕著に現れ、判別が容易になる。そして、図３（ａ）〜（ｃ）に示す各領域の分割タイプ毎に得られた２次元の特徴量は、各レコード（ウェーハＷ）毎に定義される。
【００３４】
図５は、各領域の分割タイプ毎に得られた特徴量を示す図である。これらの特徴量は、データ解析時の目的変数として扱われる。図示のように、各ウェーハＷ毎のｎ個のレコード１〜Ｎには、それぞれ分割タイプ毎のｎ個の特徴量０１〜ｎが連続値で定義される。各特徴項目の特徴量はロット単位で求められる。図には、図３（ａ）〜（ｃ）の各分割タイプについて、それぞれ異なる算出法による特徴量が格納される。たとえば、分割タイプａ（図３（ａ）に相当）における特徴３０とは、上述した属性値の大小まで考慮した計５つの領域（ＺＯＮＥ１〜ＺＯＮＥ５）に関する計３０通りの２分割方法で得られた２次元の特徴量（連続値）である。
【００３５】
次に、上記２次元の特徴量を目的変数とした回帰木分析について説明する。回帰木分析は、説明変数を同じにして抽出された各特徴量を順次目的変数に選択して、自動的に回帰木分析をおこない、それによって各特徴量を左右する要因がそれぞれについて抽出される。特に、何を解析すべきかが明確となっていない場合には、考えられるすべての特徴量を抽出し、それらを目的変数として回帰木分析を実行する。その結果、上述したように種々の解析結果が得られるので、その中で最も有意差が大とみなされる項目を歩留り改善のための対策項目の候補とする。このように従来の解析方法では容易に抽出されなかった多くの有意差が効率的に抽出される。
【００３６】
ここで、回帰木分析および評価用統計値リストについて説明する。まず、回帰木分析について簡単に説明する。回帰木分析は、複数の属性を示す説明変数とそれにより影響を受ける目的変数からなるレコードの集合を対象とし、その目的変数に最も影響を与える属性と属性値を判別するものである。データマイニングをおこなってルールファイル２４を抽出する手段２３（回帰木分析エンジン）からはデータの特徴や規則性を示すルールが出力される。
【００３７】
回帰木分析の処理は、各説明変数（属性）のパラメータ値（属性値）に基づいて集合の２分割を繰り返していくことで実現される。その集合分割の際、分割前の目的変数の平方和をＳ０、分割後の２つの集合のそれぞれの目的変数の平方和をＳ１およびＳ２としたとき、式（３）で示すΔＳが最大となるように、分割するレコードの説明変数とそのパラメータ値を求める。
【００３８】
ΔＳ＝Ｓ０−（Ｓ１＋Ｓ２）・・・（３）
【００３９】
ここで得られる説明変数とそのパラメータ値は、回帰木では分岐点に対応している。以降、分割された集合についても同様な処理を繰り返し、説明変数の目的変数に対する影響を調べる。以上が、一般によく知られている回帰木分析の手法であるが、集合分割の明確さをより詳しく把握するために、複数の上位分割候補に関して、ΔＳの他に以下のパラメータ（ａ）〜（ｄ）も回帰木分析結果の定量的な評価として使用する。これらのパラメータは評価用統計値リストとして出力される。
【００４０】
（ａ）Ｓ比：
集合分割による平方和の低減率であり、集合分割により平方和がどの程度低減したかを示すパラメータである。この値が小さいほど集合分割の効果は大きく、集合分割が明確におこなわれているので、有意差が大である。
【００４１】
Ｓ比＝（（Ｓ１＋Ｓ２）／２）／Ｓ０・・・（４）
【００４２】
（ｂ）ｔ値：
回帰木分析エンジンにより集合が２分割されるが、分割された２つの集合の平均（／Ｘ１，／Ｘ２）の差の検定のための値である。ここで、"／"は上線を示す。統計のｔ検定は、分割された集合における目的変数の平均値の有意差を示す基準となる。自由度、すなわちデータ数が同じであるなら、ｔが大きいほど集合が明確に分割されており、有意差が大である。
【００４３】
この際、分割された集合の分散に有意差がない場合には次の（５）式によりｔ値を求め、分割された集合の分散に有意差がある場合には（６）式によりｔ値を求める。ここで、Ｎ１およびＮ２は、それぞれ分割した集合１および集合２の要素数である。また、／Ｘ１および／Ｘ２はそれぞれ分割後の各集合の平均である。Ｓ１およびＳ２は、それぞれ分割後の各集合の目的変数の平方和である。
【００４４】
【数２】

【００４５】
【数３】

【００４６】
（ｃ）分割された集合の目的変数の平均値の差：
この値が大きいほど有意差が大である。
【００４７】
（ｄ）分割された各集合のデータ数：
両者の差が小さいほど異常値（ノイズ）による影響が小である。
【００４８】
そして、図５に示した特徴量が生じた要因をウェーハ検査時等の電気的特性値（たとえば電流値、電圧値、抵抗値）を説明変数として定義し回帰木分析を実行する。図６は、実際に回帰木分析を実行する際の入力データの一例を示す図である。図には、２次元のデータ分布特徴の抽出手段２１により抽出されたデータ分布特徴量をＣＳＶ形式で出力した例が示されている。各特徴量は、レコード毎に独立して求められるので、独立して扱われる。たとえば、図６に示すように、各特徴量はＣＳＶ形式で自動的に出力されるので、各特徴量毎に効率的に有意な解析をおこなうことが可能となる。上記２次元の特徴量を抽出する処理をあらかじめプログラムに組み込んでおくか、あるいは抽出する特徴量を定義したファイルを用意して、そのファイルを読み込み実行することができる。
【００４９】
いずれの特徴量も、ウェーハＷの面内における２次元の分布であり、特徴量の値についても離散値ではなく、その特徴がどの程度の強さであるかという連続値で定義されている。値が連続値であるため、従来のように離散値化による情報の欠落が生じないので、より良好な解析結果が期待される。
【００５０】
そして、目的変数とする２次元の特徴量が複数個０１〜ｎ（図５参照）あるとして、順次回帰木分析を実行する。ｎ個の各特徴量は、それぞれ異なる分割方式（たとえば、図３参照）により得られた２次元の特徴量である。図７は、回帰木分析結果を示す図である。図には目的変数を図６に示す特徴０２とし、説明変数を電流Ｉ１〜Ｉ９、電圧Ｖ１〜Ｖ５、抵抗Ｒ１、Ｒ２として回帰木分析をおこなった結果の例が示されている。図の上部には回帰木分析の結果を示す回帰木図が表され、図の下部にはこの回帰木分析の信頼度情報を表す評価用統計値リスト（Evaluation Data）が表されている。
【００５１】
回帰木分析結果であるこの図７に基づいて、電流Ｉ７が０．６７以下であるか否かが特徴０２の値に最も影響を及ぼしていることが判明する。この図に示す評価用統計値リストには、説明変数を変えたときにある目的変数（特徴０２）に最も影響を及ぼす順番が上下順に図表化して表示されている。
【００５２】
この評価用統計値リストの第２行以下の項目Ｓ比，ｔ値は、第１行目の項目のものに比べて大きく異なっていることから、電流Ｉ７の値の影響が際立っていることが判る。加えて、他の特徴量を目的変数とした回帰木分析結果については、評価用統計値リストの第１行目における項目のＳ比，ｔ値は、第２行以下の項目のものに比べてさほど差があるものはみられなかった。第１候補のものと第２候補のものとでは、それほど差がない。これは、同じ程度の複数の要因があるため、不良対策を施しにくいことを示している。以上によって、仮に、値が特徴０２よりも大きい特徴量（目的変数）が生じたとしても不良の発生要因が明確な特徴０２（電流Ｉ７が０．６７以下であるか否か）を不良対策として選定することが適切であると容易に判明できるようになる。
【００５３】
図８は、本発明にかかるデータ解析方法の概略を示すフローチャートである。以上説明したように、このデータ解析方法が開始されると、まずオリジナルデータ群４２の中から解析の対象とするデータ、たとえばウェーハＷの歩留り値や各種測定値等のデータが選択されて抽出される（ステップＳ１）。つづいて、抽出された個々のデータの帰属領域が判定される（ステップＳ２）。この帰属領域の判定は、ウェーハＷ上における個々のデータの２次元座標を抽出し、各データの座標が図３に示した各領域分割パターン毎にどの領域に属するかを判定する。次に、抽出されたデータに対してウェーハＷの面内における２次元のデータ分布特徴量を抽出する処理がおこなわれる（ステップＳ３）。
【００５４】
そして、解析対象とするデータ分布特徴量が選択され、それを目的変数として回帰木分析等のデータマイニングがおこなわれる（ステップＳ４）。ステップＳ３で抽出されたすべてのデータ分布特徴量について回帰木分析がおこなわれ（ステップＳ５：Ｎｏ〜ステップＳ４のループ）、回帰木分析が終了したら（ステップＳ５：Ｙｅｓ）、分析結果が出力され、技術者はその確認をおこなう（ステップＳ６）。そして、技術者は、分析結果に基づいて意思決定をおこなう（ステップＳ７）。ステップＳ６、ステップＳ７は、図７に示す評価用統計値リストを用いてＳ比の値やｔの値等を用いた上述の評価機能をコンピュータに自動実行させる構成とすることもできる。
【００５５】
上述した実施の形態によれば、解析対象であるウェーハＷのオリジナルデータ群内に存在するウェーハ面内の２次元のデータ分布特徴量を複数、定義および抽出し、これらを定量化する。そして、各データ分布特徴量を順次目的変数に選択して解析をおこなうことにより、不良解析等のデータ解析に有効な統計的有意差を抽出できるようになる。これにより、多種多様なデータであっても、埋もれて判別が困難であった関連性や有意差を、技術者の主観によらずに客観的に、また効率的および定量的に抽出することができるようになる。
【００５６】
なお、上記実施の形態において説明した２次元のデータ分布特徴の抽出は、解析対象であるたとえばウェーハＷを例にとると、最小単位で１枚のウェーハＷに適用でき、１ロット単位あるいは複数ロットを単位とすることができ、オリジナルデータ群から規定した単位毎におこなうことができる。また、この発明は、３次元のデータ分布特徴の抽出についても容易に適用できるものである。
【００５７】
以上説明したデータ解析方法は、あらかじめ用意されたプログラムをパーソナル・コンピュータやワークステーション等のコンピュータで実行することにより実現することができる。このプログラムは、各種記録媒体に記録され、コンピュータによって記録媒体から読み出されることによって実行される。またこのプログラムは、インターネット等のネットワークを介して配布することが可能な伝送媒体であってもよい。
【００５８】
（付記１）解析対象のデータ解析に必要なデータをオリジナルデータの中から抽出するデータ抽出工程と、
前記データ抽出工程により抽出されたデータが解析対象の２次元座標上であらかじめ設定された複数の帰属領域のうちのどの帰属領域に該当するかを判定する帰属領域判定工程と、
前記データ抽出工程によって抽出されたデータを用いて前記帰属領域判定工程によって判定された帰属領域別に２次元のデータ分布特徴量を抽出する２次元データ分布特徴量抽出工程と、
前記２次元データ分布量抽出工程によって抽出された２次元のデータ分布特徴量を用いたデータの解析をおこなうデータ解析工程と、
を含むことを特徴とするデータ解析方法。
【００５９】
（付記２）前記２次元データ分布特徴量抽出工程は、
前記２次元のデータ分布特徴量を前記帰属領域を判定するための算出式に基づき連続値として算出することで前記２次元のデータ分布特徴量を抽出することを特徴とする付記１に記載のデータ解析方法。
【００６０】
（付記３）前記解析対象毎のレコードを設定してファイル化し、各レコードに含まれる複数の前記２次元のデータ分布特徴量が互いに独立して設定されることを特徴とする付記１または２に記載のデータ解析方法。
【００６１】
（付記４）前記データ解析工程は、
前記２次元のデータ分布特徴量を目的変数として、解析すべき２次元のデータの特徴や規則を表すルールをデータマイニングにより抽出することを特徴とする付記１〜３のいずれか一つに記載のデータ解析方法。
【００６２】
（付記５）前記データ解析工程は、
前記抽出された２次元のデータ分布特徴量のうちいずれのデータ分布特徴量が最もデータ解析に有意な結果を与える目的変数であるかを前記データマイニングにより抽出することを特徴とする付記４に記載のデータ解析方法。
【００６３】
（付記６）前記ルールの信頼度を表す情報として、前記データ解析工程により得たデータの集合を２分割する際の分割の明確度を表す集合分割評価値を求める評価工程を含むことを特徴とする付記４または５に記載のデータ解析方法。
【００６４】
（付記７）前記評価工程は、
前記集合分割評価値として次の式で表されるＳ比の値を用いることを特徴とする付記６に記載のデータ解析方法。
Ｓ比＝（（Ｓ１＋Ｓ２）／２）Ｓ０
ただし、Ｓ０は、複数のオリジナルデータよりなる集合を２分割する前の目的変数の平方和、Ｓ１およびＳ２はそれぞれ分割後の各集合の目的変数の平方和である。
【００６５】
（付記８）前記評価工程は、
前記集合分割評価値として次の式で表されるｔの値を用いることを特徴とする付記６に記載のデータ解析方法。
【数４】

【００６６】
（付記９）前記評価工程は、
前記２分割した後の各分割集合に属する前記抽出されたデータの個数比を用いることを特徴とする付記６に記載のデータ解析方法。
【００６７】
（付記１０）前記評価工程は、
前記２分割を繰り返して得られた後の各分割集合が示す前記Ｓ比の値が小さく、かつ、ｔの値が大きい順に説明変数を附した評価用統計値リストを作成することを特徴とする付記６に記載のデータ解析方法。
【００６８】
（付記１１）解析対象の品質の変動を示す目的変数と、該目的変数の変動を説明する説明変数とを含むオリジナルデータのデータ解析をおこなうデータ解析プログラムであって、
前記解析対象のデータ解析に必要なデータをオリジナルデータの中から抽出させるデータ抽出工程と、
前記データ抽出工程によって抽出されたデータが前記解析対象の２次元座標上であらかじめ設定された複数の帰属領域のうちのどの帰属領域に該当するかを判定させる帰属領域判定工程と、
前記データ抽出工程によって抽出されたデータを用いて前記判定された帰属領域別に２次元のデータ分布特徴量を抽出させる２次元データ分布特徴量抽出工程と、
前記２次元データ分布特徴量抽出工程によって抽出された２次元のデータ分布特徴量を用いたデータの解析をおこなわせることを特徴とするデータ解析プログラム。
【００６９】
（付記１２）解析対象のデータ解析に必要なデータをオリジナルデータの中から抽出するデータ抽出手段と、
前記データ抽出手段により抽出されたデータが解析対象の２次元座標上であらかじめ設定された複数の帰属領域のうちのどの帰属領域に該当するかを判定する帰属領域判定手段と、
前記データ抽出手段によって抽出されたデータを用いて前記帰属領域判定手段によって判定された帰属領域別に２次元のデータ分布特徴量を抽出する２次元データ分布特徴量抽出手段と、
前記抽出された２次元のデータ分布特徴量を用いたデータの解析をおこなうデータ解析手段と、
を備えたことを特徴とするデータ解析装置。
【００７０】
（付記１３）前記データ解析手段により解析された２次元のデータの特徴や規則であるルールの信頼度を表す情報として、前記データ解析手段により得たデータの集合を２分割する際の分割の明確度を表す集合分割評価値を求める評価手段を備えたことを特徴とする付記１２に記載のデータ解析装置。
【００７１】
（付記１４）前記解析対象は、半導体用ウェーハの面上に複数配置される各チップであり該各チップの特性値が前記オリジナルデータとして出力され、
前記帰属領域判定手段は、前記ウェーハの面上を複数の領域に分割し、該分割された複数の領域を任意の２領域のいずれかに含ませて前記帰属領域を判定することを特徴とする付記１３に記載のデータ解析装置。
【００７２】
（付記１５）前記２次元データ分布特徴量抽出手段は、
前記任意の２領域にそれぞれ属する各データの代表値を特徴量とし、該２領域の総合値同士の差分、あるいは比を２次元のデータ分布特徴量とすることを特徴とする付記１４に記載のデータ解析装置。
【００７３】
【発明の効果】
本発明によれば、解析対象の２次元の特徴量抽出の対象の個々のデータがどの分割領域に属しているかを、個々のデータの２次元座標を抽出し、それが各領域分割パターン毎にどの領域に属するかを判定し、連続値として得ることができるため、技術者の経験、ノウハウに依存する割合を減らし２次元の特徴量を容易に定義、抽出でき、抽出された特徴量を用いて２次元の座標上でのデータ分布特徴を容易に抽出でき、データ解析を効率的におこなうことができるという効果を奏する。
【図面の簡単な説明】
【図１】本発明の実施の形態にかかるデータ解析方法の実施に供せられる計算機システムのハードウェア構成の一例を示す図である。
【図２】図１に示す構成の計算機システムにより実現されるデータ解析装置の機能構成の一例を示すブロック図である。
【図３】２次元のデータ分布特徴の抽出手段が抽出する分布特徴の定義を示す図である。
【図４】複数領域を２分割した例を示す図である。
【図５】各領域の分割タイプ毎に得られた特徴量を示す図である。
【図６】実際に回帰木分析を実行する際の入力データの一例を示す図である。
【図７】回帰木分析結果を示す図である。
【図８】本発明にかかるデータ解析方法の概略を示すフローチャートである。
【図９】従来のデータ解析時に用いる表示画面を示す図である。
【図１０】特性試験で合格とされたチップのウェーハ面内分布を示す図である。
【符号の説明】
１入力装置
２中央処理装置
３出力装置
４記憶装置
２１データ分布特徴の抽出手段
２２解析対象特徴量の選択手段
２３データマイニング手段
２４特徴、規則性のルールファイル
２５統計解析コンポーネント
２６図表作成コンポーネント
２７解析ツール群
４１データベース
４２オリジナルデータ群[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a data analysis method for extracting a meaningful result for obtaining an industrially superior result by grasping a relation between data widely handled in the industry, and further, a data value to be analyzed and its value The present invention relates to a data analysis method for extracting knowledge and information that are difficult to discriminate by paying attention only to an average value.
[0002]
In particular, two-dimensional data analysis is possible. For example, the data distribution feature amount depends on the data such as the characteristic value of the chip arranged on the wafer surface as the analysis target and the two-dimensional arrangement position of each chip on the wafer surface. The present invention relates to a data analysis method for efficiently performing two-dimensional data analysis of an analysis target.
[0003]
[Prior art]
As an analysis target, description will be made by taking an example of yield analysis of chips on a wafer. In particular, when trying to obtain reference data for determining measures to improve quality and productivity from the analysis results, such as process data analysis, the accuracy, reliability, etc. are important. An application has already been filed by the inventors (see, for example, Patent Document 1 below, and Japanese Patent Application No. 2001-127534).
[0004]
In order to find out the yield reduction factor as quickly as possible and take countermeasures, to find the factor that is effective in the yield and other factors that are effective in that factor from the device history, test results, design information, various measurement data, etc. Data analysis is performed.
[0005]
In data analysis, an analysis target such as a yield value is an objective variable, device history that causes the objective variable, test results, design information, various measurement data, and the like are called explanatory variables. Various statistical methods are applied at that time, and by applying data mining as one of them, it is possible to extract valuable information and regularity that are difficult to discriminate from a large amount of data.
[0006]
In order to analyze a failure factor of a semiconductor device, it is important to analyze the collected data on a multifaceted basis on a scientific basis and extract more significant differences. Therefore, conventionally, the value of the original data stored in the computer system and its average value are often used. However, there are cases where it is difficult to extract a failure factor or the like from a complicatedly intertwined original data group. In such a case, if there is a characteristic data distribution regarding various measurement results, yields, and the like of the chips in the wafer surface and the wafers in the lot, the analysis of the defect data can proceed.
[0007]
In order to perform data analysis on the yield of semiconductor devices, in addition to using the data values stored in the computer system, such as the yield values, values obtained by editing these data values, for example, distribution characteristics of wafer yield values in the lot It is also useful to utilize The present applicant has applied for a configuration for performing data analysis on a one-dimensional feature quantity called a data distribution feature in a lot (Japanese Patent Application No. 2001-338185).
[0008]
FIG. 9 is a diagram showing a display screen used in conventional data analysis. Conventionally, analysis of a two-dimensional data distribution feature in a wafer surface has been performed using a display screen as shown in the figure. In the example shown in the figure, the distribution state of the yield in the wafer surface of a designated lot (designation of a plurality of lots is also possible) is displayed as an image. On the left side 100a of the display screen 100, the surface of the wafer W is divided into a plurality of frames 101, and each frame 101, that is, each yield of chips existing at the same position of the wafer W is shown.
[0009]
Further, on the upper right side 100b of the display screen 100, the yield in the case of two shot positions at the time of burning shot in the semiconductor manufacturing process is displayed as an image and a numerical value. In addition, in the right-side center portion 100c of the display screen 100, when the wafer W is divided into circular cuts (total of 8 regions) in the same radial direction by a predetermined width, the yield of chips belonging to each divided region is displayed as an image and a numerical value. Has been. Further, in the lower right portion 100d of the display screen 100, when the wafer W is divided vertically and horizontally and obliquely (total 8 regions), the yield of chips belonging to each divided region is displayed as an image and a numerical value. The larger the mark in each image, the higher the yield.
[0010]
Next, FIG. 10 is a diagram showing the distribution in the wafer plane of the chips that have passed the characteristic test. In the drawing, defective chips are shown as blank, but the occurrence state of the defective chips is not uniformly distributed in the plane of the wafer W, but has a characteristic.
[0011]
Then, according to each division pattern described above for chips belonging to the same position on the wafer surface or within the wafer surface, the attribute values of the chips belonging to each region are totalized for each fixed division region, and the analyst displays the above It can be visually confirmed using a screen. In addition to this, information useful for more analysis exists in the data distribution characteristics in the wafer surface.
[0012]
[Patent Document 1]
JP 2001-306999 A
[0013]
[Problems to be solved by the invention]
However, in fact, there are discontinuities in the numerical values of the data, and there are features that span multiple areas. Especially, when extracting two-dimensional features in the wafer surface, the analyst has the ability to recognize patterns. However, effective information could not be easily recognized. In addition, the work of visually confirming such two-dimensional feature values for all lots and wafers is a huge amount of work.
[0014]
Therefore, not only using the pattern shown in FIG. 9, but also a technique for automatically extracting a two-dimensional feature value of a more general-purpose data distribution and a method of analyzing a two-dimensional feature value of the extracted data distribution Was demanded. Conventionally, in semiconductor yield data analysis, data distribution characteristics within the wafer surface, that is, analysis using two-dimensional feature values, requires experience and know-how of engineers.
[0015]
The present invention has been made in view of the above-described problems, and can reduce the proportion depending on the experience and know-how of engineers and easily define and extract two-dimensional data distribution feature amounts. An object of the present invention is to provide a data analysis method capable of efficiently performing data analysis using a quantity.
[0016]
[Means for Solving the Problems]
In order to achieve the above object, the present invention is necessary for analyzing data to be analyzed. Chip yield on wafer Data extraction process for extracting data from the original data, and extracted by the data extraction process Chip yield on wafer An attribution region determination step for determining which attribution region corresponds to data among a plurality of attribution regions set in advance on the two-dimensional coordinates to be analyzed, and the data extraction step Chip yield on wafer A two-dimensional data distribution feature amount extraction step for extracting a two-dimensional data distribution feature amount for each belonging region determined by the belonging region determination step using data, and 2 extracted by the two-dimensional data distribution feature amount extraction step A data analysis step of analyzing data using a two-dimensional data distribution feature amount, wherein the two-dimensional data distribution feature amount extraction step is configured to convert the two-dimensional data distribution feature amount into a plurality of attribution regions. The two-dimensional data distribution feature amount is extracted by dividing into two regions and calculating as continuous values based on a predetermined calculation formula using representative values of the two divided regions.
[0017]
According to the present invention, the two-dimensional coordinates of the individual data are extracted to determine which divided area the individual data of the two-dimensional feature extraction target to be analyzed belongs, and for each area division pattern. By determining which region it belongs to, it can be obtained as a continuous value. Then, when the feature amount is not uniformly distributed on the two-dimensional coordinates, the feature amount of the two-dimensional data distribution can be easily extracted, and data analysis can be performed efficiently.
[0018]
DETAILED DESCRIPTION OF THE INVENTION
Exemplary embodiments of a data analysis method according to the present invention will be explained below in detail with reference to the accompanying drawings. In this embodiment, an example in which a two-dimensional data distribution feature in the plane of a wafer is extracted and analyzed as a specific example of an analysis target will be described.
[0019]
FIG. 1 is a diagram illustrating an example of a hardware configuration of a computer system that is provided for carrying out a data analysis method according to an embodiment of the present invention. As shown in FIG. 1, the computer system includes an input device 1, a central processing device 2, an output device 3, and a storage device 4.
[0020]
FIG. 2 is a block diagram showing an example of a functional configuration of the data analysis apparatus realized by the computer system having the configuration shown in FIG. As shown in FIG. 2, the data analyzing apparatus has an original data group 42 including a database 41 including a plurality of original data. The database 41 is constructed in the storage device 4 of the computer system shown in FIG.
[0021]
Further, the data analysis apparatus quantitatively evaluates and extracts the distribution feature of one or more data existing in the original data group 42 (extraction of data distribution feature) 21, and the extracted one or more data distribution features. Means 22 for selecting a feature quantity to be analyzed from the quantity (selection of analysis target feature quantity), data distribution feature quantity selected as the analysis target is used as a target variable, and data mining is performed by a regression tree analysis method, etc. (Data mining) means 23 for extracting a rule file 24 such as features and regularity lurking in the data, a statistical analysis component 25 for analyzing the distribution characteristics of the original data using the extracted features and regularity (rule file) 24, and a chart An analysis tool group 27 such as a creation component 26 is provided.
[0022]
Each of the above means 21, 22, 23 and the analysis tool group 27 is realized by executing a program for performing each processing in the central processing unit 2. The extracted rule file 24 is stored in the storage device 4 and is output by the output device 3 such as a display device or a printing device. Decision making 5 is made based on the analysis result by the analysis tool group 27.
[0023]
The above-described means 23 for extracting the rule file 24 includes the original data in the original data group 42, the two-dimensional data distribution feature extracted by the data distribution feature extraction means 21, or the analysis result by the analysis tool group 27. Data mining is also performed on The analysis tool group 27 also analyzes the original data in the original data group 42, the two-dimensional data distribution feature extracted by the data distribution feature extraction means 21, or the output result of the analysis tool group 27. It is supposed to do. The analysis result by the analysis tool group 27 is fed back to the analysis target feature quantity selection means 22 and the original data group 42. Further, the output of the data distribution feature extraction means 21 is fed back to the original data group 42.
[0024]
Here, extraction of feature amounts in the data distribution feature extraction unit 21 will be described. The distribution feature such as the wafer yield value in the lot described above is a one-dimensional feature value, and this one-dimensional feature value varies with the wafer yield, the electrical characteristic value of the wafer, or the time. It is calculated from the yield value and electrical characteristic value. The attribute values (wafer number in the lot, time, etc.) of the individual data to be extracted are used as variables for extracting the feature values.
[0025]
In contrast, the data analysis apparatus of the present invention extracts a two-dimensional feature amount. This two-dimensional feature amount is calculated for each divided region from the chip in the wafer, the yield of the chip group, the electrical characteristic value, and the like. For this reason, it is necessary to determine in which divided area each piece of data to be subjected to feature quantity extraction belongs to each divided pattern. It is necessary to extract two-dimensional coordinates of individual data and determine to which area each area division pattern belongs.
[0026]
FIG. 3 is a diagram showing the definition of distribution features extracted by the two-dimensional data distribution feature extraction means 21. The target of the distribution feature is an attribute value of each region when the wafer W is divided by a predetermined dividing method. A plurality of division types are prepared. The division type shown in FIG. 3A is an example in which the plane of the wafer W is divided into a total of five in the center and four directions. The division type shown in FIG. 3B is an example in which the plane of the wafer W is equally divided into eight from the center. The division type shown in FIG. 3C is an example in which the in-plane of the wafer W is divided into three circular shapes (annular shapes) from the center for each predetermined diameter. In addition to the above example, it is possible to add and delete division types suitable for narrowing down other failure factors.
[0027]
In the data analysis processing to be described later, for each division type, it is determined which two-divided region and its combination can find a significant difference in the target data value compared to other regions and combinations. Yes.
[0028]
Next, means for digitizing the two-dimensional data distribution feature will be described. A specific description will be given by taking the division type shown in FIG. There are 30 combinations of the two division methods for the total of five areas (ZONE1 to ZONE5) in consideration of the size of the attribute value (= _Five C ₁ + _Five C ₂ + _Five C _Three + _Five C _Four ) Here, xi is a data value in each area, and all 5 areas are divided into arbitrary 2 areas. FIG. 4 is a diagram illustrating an example in which a plurality of regions are divided into two. In the illustrated example, one of the two divided areas (data value xh) is ZONE1,2,3, and the other b (data value xl) is ZONE4,5.
[0029]
here,
xi: Average value of xi in all regions (ZONE1 to 5)
x ₁ ~ _Three : Average value of xi in one region (xh: ZONE1, 2, 3)
x _Four ~ _Five : The average value of xi in the other region (xl: ZONE4, 5)
Then, the average value x of the divided areas xh and xl shown in FIG. ₁ ~ _Three And x _Four ~ _Five Are expressed by the following formulas (1) and (2), respectively, with a and b as variables.
[0030]
[Expression 1]

[0031]
It becomes. Here, (a−b) is obtained for all of the 30 region dividing methods for the a and b regions divided into two, and this can be quantitatively defined as a feature value for each dividing method. The two-dimensional data distribution feature amount on the surface of the wafer W can be defined by the value of b). Here, as the feature quantity, representative values (for example, average value, median value, minimum value, maximum value, standard deviation, etc.) of the areas a and b are used, and the difference between the areas a and b (a -B), ratio (a: b) can be used. In any case, a continuous value can be obtained as a value by an operation such as the above mathematical formula.
[0032]
The above description relates to the calculation of the two-dimensional data distribution feature amount for the division type shown in FIG. 3A, but the above equation (1) is similarly applied to FIGS. 3B and 3C. , (2). That is, even if the division type of the region is different, the value of the difference (ab) has the same meaning. Therefore, even if the value of the difference (ab) obtained by the division type of the different region is used. The amount can be treated equally. Therefore, for each wafer W, it is not determined for each area division type shown in FIGS. 3A to 3C, but comprehensively determining which division method of which area division type is most remarkable. Is possible.
[0033]
As for the difference (ab) between the areas a and b, the larger the absolute value | a−b | And the two-dimensional feature-value obtained for every division | segmentation type of each area | region shown to Fig.3 (a)-(c) is defined for every record (wafer W).
[0034]
FIG. 5 is a diagram illustrating the feature amounts obtained for each division type of each region. These feature quantities are treated as objective variables at the time of data analysis. As illustrated, in the n records 1 to N for each wafer W, n feature amounts 01 to n for each division type are defined as continuous values. The feature amount of each feature item is obtained in lot units. In the figure, feature quantities obtained by different calculation methods are stored for each of the division types shown in FIGS. For example, the feature 30 in the division type a (corresponding to FIG. 3 (a)) was obtained by a total of 30 two-division methods for the above-described five regions (ZONE1 to ZONE5) taking into account the magnitude of the attribute value. It is a two-dimensional feature quantity (continuous value).
[0035]
Next, regression tree analysis using the two-dimensional feature amount as an objective variable will be described. In the regression tree analysis, each feature quantity extracted with the same explanatory variable is selected as the target variable in order, and the regression tree analysis is automatically performed, thereby extracting the factors that influence each feature quantity. . In particular, if it is not clear what to analyze, all possible feature quantities are extracted, and regression tree analysis is performed using them as objective variables. As a result, various analysis results are obtained as described above, and an item that is regarded as having the largest significant difference among them is set as a candidate for a measure item for yield improvement. As described above, many significant differences that are not easily extracted by the conventional analysis method can be extracted efficiently.
[0036]
Here, the regression tree analysis and the statistical value list for evaluation will be described. First, the regression tree analysis will be briefly described. In the regression tree analysis, a set of records composed of explanatory variables indicating a plurality of attributes and objective variables affected thereby is determined, and attributes and attribute values that most affect the objective variables are discriminated. A rule indicating data characteristics and regularity is output from the means 23 (regression tree analysis engine) that performs data mining and extracts the rule file 24.
[0037]
The regression tree analysis process is realized by repeatedly dividing the set into two based on the parameter value (attribute value) of each explanatory variable (attribute). When the set is divided, if the sum of squares of the objective variables before the division is S0 and the sum of squares of the respective objective variables of the two sets after the division is S1 and S2, ΔS shown in the equation (3) becomes the maximum. As described above, the explanatory variable of the record to be divided and its parameter value are obtained.
[0038]
ΔS = S0− (S1 + S2) (3)
[0039]
The explanatory variables and parameter values obtained here correspond to branch points in the regression tree. Thereafter, the same processing is repeated for the divided sets, and the influence of the explanatory variable on the objective variable is examined. The above is a generally well-known method of regression tree analysis. In order to understand the clarity of set partitioning in more detail, in addition to ΔS, the following parameters (a) to ( d) is also used as a quantitative evaluation of the regression tree analysis results. These parameters are output as an evaluation statistical value list.
[0040]
(A) S ratio:
This is a reduction rate of the sum of squares by the set division, and is a parameter indicating how much the sum of squares has been reduced by the set division. The smaller this value is, the greater the effect of set partitioning is, and the set partition is clearly performed, so the significant difference is large.
[0041]
S ratio = ((S1 + S2) / 2) / S0 (4)
[0042]
(B) t value:
Although the set is divided into two by the regression tree analysis engine, it is a value for testing the difference between the averages (/ X1, / X2) of the two divided sets. Here, “/” indicates an overline. The statistical t-test is a standard indicating a significant difference in the mean value of the objective variable in the divided set. If the degree of freedom, that is, the number of data is the same, the larger the t, the more clearly the set is divided and the greater the difference.
[0043]
At this time, if there is no significant difference in the variance of the divided sets, the t value is obtained by the following equation (5). If there is a significant difference in the variance of the divided sets, the t value is obtained by the equation (6). Ask for. Here, N1 and N2 are the numbers of elements of set 1 and set 2, respectively. Further, / X1 and / X2 are averages of the respective sets after the division. S1 and S2 are the sum of squares of the objective variable of each set after division.
[0044]
[Expression 2]

[0045]
[Equation 3]

[0046]
(C) Difference in mean value of objective variables of divided sets:
The greater this value, the greater the difference.
[0047]
(D) Number of data in each divided set:
The smaller the difference between the two, the smaller the influence of abnormal values (noise).
[0048]
Then, the cause of the feature quantity shown in FIG. 5 is defined by defining electrical characteristic values (for example, current value, voltage value, resistance value) at the time of wafer inspection or the like as explanatory variables, and regression tree analysis is executed. FIG. 6 is a diagram illustrating an example of input data when the regression tree analysis is actually executed. The figure shows an example in which the data distribution feature amount extracted by the two-dimensional data distribution feature extraction means 21 is output in CSV format. Since each feature amount is obtained independently for each record, it is handled independently. For example, as shown in FIG. 6, since each feature amount is automatically output in the CSV format, significant analysis can be performed efficiently for each feature amount. The process of extracting the two-dimensional feature value can be incorporated in the program in advance, or a file defining the feature value to be extracted can be prepared and read and executed.
[0049]
Each feature amount is a two-dimensional distribution in the plane of the wafer W, and the feature value is not a discrete value but is defined by a continuous value indicating how strong the feature is. Since the values are continuous values, there is no loss of information due to discrete values as in the prior art, so a better analysis result is expected.
[0050]
Then, the regression tree analysis is sequentially performed on the assumption that there are a plurality of two-dimensional feature amounts 01 to n (see FIG. 5) as objective variables. Each of the n feature amounts is a two-dimensional feature amount obtained by a different division method (see, for example, FIG. 3). FIG. 7 is a diagram showing a regression tree analysis result. The figure shows an example of the result of regression tree analysis with the objective variable as the characteristic 02 shown in FIG. 6 and the explanatory variables as currents I1 to I9, voltages V1 to V5, and resistors R1 and R2. A regression tree diagram showing the results of the regression tree analysis is shown at the top of the diagram, and an evaluation statistical value list (Evaluation Data) representing the reliability information of the regression tree analysis is shown at the bottom of the diagram.
[0051]
Based on this regression tree analysis result shown in FIG. 7, it is found that whether or not the current I7 is 0.67 or less has the most influence on the value of the feature 02. In the statistical value list for evaluation shown in this figure, the order that most affects the target variable (feature 02) when the explanatory variable is changed is displayed in a graph in the top-bottom order.
[0052]
Since the item S ratio and t value in the second and lower rows of this statistical value list for evaluation are greatly different from those in the first row, the influence of the current I7 value may be conspicuous. I understand. In addition, for regression tree analysis results with other feature values as objective variables, the S ratio and t value of items in the first row of the statistical value list for evaluation are compared with those of items in the second and lower rows. There was no significant difference. There is not much difference between the first candidate and the second candidate. This indicates that it is difficult to take measures against defects because there are a plurality of factors of the same degree. As described above, even if a feature amount (objective variable) having a value larger than that of the feature 02 is generated, the feature 02 (whether the current I7 is 0.67 or less) having a clear cause of failure is taken as a countermeasure against the failure. It becomes easy to find out that the selection is appropriate.
[0053]
FIG. 8 is a flowchart showing an outline of the data analysis method according to the present invention. As described above, when this data analysis method is started, data to be analyzed, for example, data such as yield values and various measurement values of the wafer W is first selected and extracted from the original data group 42. (Step S1). Subsequently, the belonging area of the extracted individual data is determined (step S2). In the determination of the belonging area, two-dimensional coordinates of individual data on the wafer W are extracted, and it is determined to which area each data coordinate belongs to each area division pattern shown in FIG. Next, a process of extracting a two-dimensional data distribution feature amount in the plane of the wafer W is performed on the extracted data (step S3).
[0054]
Then, a data distribution feature amount to be analyzed is selected, and data mining such as regression tree analysis is performed using it as an objective variable (step S4). Regression tree analysis is performed on all the data distribution features extracted in step S3 (step S5: No to step S4 loop), and when the regression tree analysis is completed (step S5: Yes), the analysis result is output, The engineer performs the confirmation (step S6). Then, the engineer makes a decision based on the analysis result (step S7). Steps S6 and S7 may be configured to cause the computer to automatically execute the above-described evaluation function using the S ratio value, the t value, and the like using the evaluation statistical value list shown in FIG.
[0055]
According to the above-described embodiment, a plurality of two-dimensional data distribution feature quantities in the wafer surface existing in the original data group of the wafer W to be analyzed are defined and extracted, and these are quantified. Then, by selecting and analyzing each data distribution feature amount as an objective variable sequentially, it becomes possible to extract a statistically significant difference effective for data analysis such as defect analysis. This makes it possible to objectively, efficiently, and quantitatively extract the relevance and significant differences that were buried and difficult to discriminate even with a wide variety of data, regardless of the subjectivity of the engineer. become able to.
[0056]
Note that the extraction of the two-dimensional data distribution feature described in the above embodiment can be applied to one wafer W in the minimum unit, for example, when the wafer W to be analyzed is taken as an example. Can be used for each unit defined from the original data group. The present invention can also be easily applied to extraction of three-dimensional data distribution features.
[0057]
The data analysis method described above can be realized by executing a program prepared in advance on a computer such as a personal computer or a workstation. This program is recorded on various recording media and executed by being read from the recording media by a computer. The program may be a transmission medium that can be distributed via a network such as the Internet.
[0058]
(Appendix 1) A data extraction process for extracting data necessary for analysis of data to be analyzed from original data;
An attribution region determination step for determining which attribution region among a plurality of attribution regions set in advance on the two-dimensional coordinates to be analyzed corresponds to the data extracted by the data extraction step;
A two-dimensional data distribution feature amount extraction step for extracting a two-dimensional data distribution feature amount for each attribution region determined by the attribution region determination step using the data extracted by the data extraction step;
A data analysis step for analyzing data using the two-dimensional data distribution feature amount extracted by the two-dimensional data distribution amount extraction step;
A data analysis method comprising:
[0059]
(Supplementary Note 2) The two-dimensional data distribution feature extraction step includes:
The data according to appendix 1, wherein the two-dimensional data distribution feature amount is extracted by calculating the two-dimensional data distribution feature amount as a continuous value based on a calculation formula for determining the belonging region. analysis method.
[0060]
(Supplementary note 3) In

Supplementary note

1 or 2, wherein a record for each analysis target is set and filed, and a plurality of the two-dimensional data distribution feature quantities included in each record are set independently of each other The data analysis method described.
[0061]
(Appendix 4) The data analysis step
4. The rule according to any one of appendices 1 to 3, wherein a rule representing a feature or rule of two-dimensional data to be analyzed is extracted by data mining using the two-dimensional data distribution feature amount as an objective variable. Data analysis method.
[0062]
(Appendix 5) The data analysis step
The supplementary note 4, wherein the data mining extracts which of the extracted two-dimensional data distribution feature values is the objective variable that gives the most significant result to the data analysis. Data analysis method.
[0063]
(Additional remark 6) It includes the evaluation process which calculates | requires the set division | segmentation evaluation value showing the division | segmentation clarity at the time of dividing the data set obtained by the said data analysis process into 2 as information showing the reliability of the said rule, It is characterized by the above-mentioned. The data analysis method according to

appendix

4 or 5.
[0064]
(Supplementary note 7)
The data analysis method according to appendix 6, wherein an S ratio value represented by the following equation is used as the set partition evaluation value.
S ratio = ((S1 + S2) / 2) S0
However, S0 is the sum of squares of the objective variables before dividing the set of a plurality of original data into two, and S1 and S2 are the sum of squares of the objective variables of each set after the division.
[0065]
(Appendix 8) The evaluation process
The data analysis method according to appendix 6, wherein a value of t represented by the following expression is used as the set partition evaluation value.
[Expression 4]

[0066]
(Supplementary note 9)
7. The data analysis method according to appendix 6, wherein the number ratio of the extracted data belonging to each divided set after the division into two is used.
[0067]
(Supplementary Note 10) The evaluation step is as follows.
A statistical value list for evaluation is created in which an explanatory variable is attached in order from the smallest value of the S-ratio indicated by each divided set obtained by repeating the two divisions and the largest value of t. The data analysis method according to attachment 6.
[0068]
(Supplementary note 11) A data analysis program for performing data analysis of original data including an objective variable indicating a variation in quality of an analysis target and an explanatory variable explaining the variation of the objective variable,
A data extraction step for extracting data necessary for data analysis of the analysis target from the original data;
An attribution region determination step for determining which attribution region of a plurality of attribution regions set in advance on the two-dimensional coordinates of the analysis target corresponds to the data extracted by the data extraction step;
A two-dimensional data distribution feature amount extraction step for extracting a two-dimensional data distribution feature amount for each determined attribution region using the data extracted in the data extraction step;
A data analysis program for performing data analysis using the two-dimensional data distribution feature amount extracted by the two-dimensional data distribution feature amount extraction step.
[0069]
(Supplementary Note 12) Data extraction means for extracting data necessary for analysis of data to be analyzed from original data;
Belonging area determination means for determining which belonging area among a plurality of belonging areas set in advance on the two-dimensional coordinates to be analyzed corresponds to the data extracted by the data extracting means;
Two-dimensional data distribution feature quantity extraction means for extracting a two-dimensional data distribution feature quantity for each belonging area determined by the belonging area determination means using the data extracted by the data extraction means;
Data analysis means for analyzing data using the extracted two-dimensional data distribution feature quantity;
A data analysis apparatus comprising:
[0070]
(Supplementary note 13) Clarification of division when the data set obtained by the data analysis means is divided into two as information representing the reliability of the rule which is the feature or rule of the two-dimensional data analyzed by the data analysis means 13. The data analysis apparatus according to appendix 12, further comprising an evaluation unit that obtains a set division evaluation value representing a degree.
[0071]
(Additional remark 14) The said analysis object is each chip | tip arrange | positioned in multiple numbers on the surface of a semiconductor wafer, The characteristic value of each said chip | tip is output as said original data,
The belonging area determining means divides the wafer surface into a plurality of areas, and includes the divided areas in any two areas to determine the belonging area. The data analysis apparatus according to appendix 13.
[0072]
(Supplementary Note 15) The two-dimensional data distribution feature amount extraction means includes:
Item 15. The supplementary note 14, wherein a representative value of each data belonging to each of the two arbitrary regions is a feature amount, and a difference or ratio between the total values of the two regions is a two-dimensional data distribution feature amount. Data analysis device.
[0073]
【The invention's effect】
According to the present invention, the two-dimensional coordinates of individual data are extracted to determine which divided area the individual data of the two-dimensional feature extraction target to be analyzed belongs, and for each area division pattern. Since it can be determined as to which region it belongs and can be obtained as a continuous value, the ratio depending on the experience and know-how of engineers can be reduced and two-dimensional feature values can be easily defined and extracted, and the extracted feature values are used. Thus, the data distribution feature on the two-dimensional coordinates can be easily extracted, and the data analysis can be performed efficiently.
[Brief description of the drawings]
FIG. 1 is a diagram showing an example of a hardware configuration of a computer system that is provided for carrying out a data analysis method according to an embodiment of the present invention.
FIG. 2 is a block diagram showing an example of a functional configuration of a data analysis apparatus realized by the computer system having the configuration shown in FIG.
FIG. 3 is a diagram illustrating a definition of distribution features extracted by a two-dimensional data distribution feature extraction unit;
FIG. 4 is a diagram illustrating an example in which a plurality of areas are divided into two.
FIG. 5 is a diagram showing feature amounts obtained for each division type of each region;
FIG. 6 is a diagram showing an example of input data when actually executing a regression tree analysis.
FIG. 7 is a diagram illustrating a regression tree analysis result.
FIG. 8 is a flowchart showing an outline of a data analysis method according to the present invention.
FIG. 9 is a diagram showing a display screen used in conventional data analysis.
FIG. 10 is a view showing a distribution in a wafer plane of chips that have passed the characteristic test.
[Explanation of symbols]
1 Input device
2 Central processing unit
3 Output device
4 storage devices
21 Data distribution feature extraction means
22 Means for selecting analysis target feature
23 Data mining means
24 Feature and regularity rule files
25 Statistical analysis components
26 Charting components
27 Analysis tools
41 database
42 Original data group

Claims

A data extraction process for extracting chip yield data on the wafer necessary for analyzing the data to be analyzed from the original data;
An attribution region determination step of determining which attribution region among a plurality of attribution regions set in advance on the two-dimensional coordinates to be analyzed corresponds to the chip yield data on the wafer extracted by the data extraction step;
A two-dimensional data distribution feature amount extraction step for extracting a two-dimensional data distribution feature amount for each belonging region determined by the belonging region determination step using chip yield data on the wafer extracted by the data extraction step;
A data analysis step of analyzing data using the two-dimensional data distribution feature amount extracted by the two-dimensional data distribution feature amount extraction step,
The two-dimensional data distribution feature extraction step includes:
By calculating the two-dimensional data distribution feature amount as a continuous value based on a predetermined calculation formula using a representative value of the divided two regions, the plurality of belonging regions are divided into two regions. A method for analyzing chip yield data on a wafer , characterized by extracting a dimensional data distribution feature.

The record for each analysis target is set and filed, and a plurality of the two-dimensional data distribution feature quantities included in each record are set independently of each other . Chip yield data analysis method.

The data analysis step includes
3. A chip on a wafer according to claim 1, wherein a rule representing a feature or rule of two-dimensional data to be analyzed is extracted by data mining using the two-dimensional data distribution feature amount as an objective variable. Yield data analysis method.

The information representing the reliability of the rule includes an evaluation step for obtaining a set division evaluation value representing the degree of clarity of division when the data set obtained by the data analysis step is divided into two. The method for analyzing the yield of chips on the wafer as described in 1 above .