CN112613264A - Distributed extensible small chip design framework - Google Patents
Distributed extensible small chip design framework Download PDFInfo
- Publication number
- CN112613264A CN112613264A CN202011573655.XA CN202011573655A CN112613264A CN 112613264 A CN112613264 A CN 112613264A CN 202011573655 A CN202011573655 A CN 202011573655A CN 112613264 A CN112613264 A CN 112613264A
- Authority
- CN
- China
- Prior art keywords
- distributed
- scalable
- chip
- architecture
- chiplet
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000004519 manufacturing process Methods 0.000 claims abstract description 9
- 235000012431 wafers Nutrition 0.000 claims abstract description 9
- 230000015654 memory Effects 0.000 claims description 4
- 230000005540 biological transmission Effects 0.000 claims description 2
- 230000006870 function Effects 0.000 description 14
- 238000000034 method Methods 0.000 description 11
- 238000010586 diagram Methods 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 230000008901 benefit Effects 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000008439 repair process Effects 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 210000001503 joint Anatomy 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/30—Circuit design
- G06F30/38—Circuit design at the mixed level of analogue and digital signals
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2115/00—Details relating to the type of the circuit
- G06F2115/02—System on chip [SoC] design
Landscapes
- Engineering & Computer Science (AREA)
- Computer Hardware Design (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Evolutionary Computation (AREA)
- Geometry (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Design And Manufacture Of Integrated Circuits (AREA)
- Semiconductor Integrated Circuits (AREA)
Abstract
The invention discloses a distributed extensible small chip design framework, which designs the same or different functional module frameworks on separate wafers to realize distributed extensible computation; the functional module architecture realizes specific design through the architecture of a small chip; and the small chips realize interconnection and cooperative work through a high-speed chip interconnection link. The invention breaks the limitation of the SoC on the area of a single chip and the accompanying performance and computational power limitation. The high production yield of the small chips is improved, the total chip cost is reduced, and the flexible expandability in application configuration and performance is achieved.
Description
Technical Field
The invention discloses a distributed extensible small chip design framework and relates to the technical field of chip design.
Background
The chip design technology adopted in the market is mainly that only a single die (die) is provided in a single package, such as NVIDIA previous generation architecture Pascal and current latest architecture graphics (ringing), and the number of transistors (Transistor Count) is increased from 12 billion to as much as 18.6 billion, which is increased by 55%. The wafer area is increased from 471mm 2 to 754mm 2 by 60%, which is not the result of the computational advanced process scaling. This means that some designs cannot benefit from process scaling, but rather expensive processes are used for these designs. On the other hand, since the area of a single wafer is so large, the yield of the product is affected only by an atomic defect or a silk impurity in the wafer manufacturing process. In order to avoid the whole die from being scrapped, a backup design and repair circuit must be added to the die, which will significantly reduce the effective utilization rate of the die.
A conventional SoC architecture is shown in fig. 1, which includes several functional modules and functional blobs connected by a system bus. The functional module can be a logic module, an analog module or a memory module. The functional blocks are the aggregation of the same functional modules, and can cumulatively provide stronger functions. The system bus provides high bandwidth inter-module interconnections as a high speed direct path for data transfers.
In order to effectively use the advantages of advanced process technology, a single chip (chip) package is used to carry a plurality of small chips (also called chiplets), so that each small chip can be controlled to a good yield, and the design complexity and corresponding silicon area cost of the backup design and repair circuit are simplified. On the other hand, for designs such as analog circuits that cannot be advantageously implemented in a micro tape manufacturing process, such as a 12 nm or 7 nm process, the designs are concentrated on the chips of the mainstream manufacturing process, such as a 28 nm or 22 nm process, so as to improve the cost performance of the chips. And the flexibility of the chip is also improved by putting the interface function on the small chip. Furthermore, scalability in performance can also be achieved by packaging different numbers of chiplets for different target markets.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: aiming at the defects of the prior art, a distributed extensible small chip design architecture is provided, the design limitation of SoC is broken through, an extensible computing scheme is provided by a small chip (chipset) architecture, flexible configurability and extensibility (scalability) in performance are achieved according to application requirements, and flexible controllable distributed computing capability is provided by distributed computing units and matched analog circuits/memories.
The invention adopts the following technical scheme for solving the technical problems:
a distributed extensible small chip design architecture is characterized in that the same or different functional module architectures are designed on separate wafers to realize distributed extensible computation; the functional module architecture realizes specific design through the architecture of a small chip; and the small chips realize interconnection and cooperative work through a high-speed chip interconnection link.
For a further preferred solution, in the distributed scalable computation, the distribution is specifically that a share of computation task is performed by two or more separate chiplets in a coordinated manner. The method can be expanded, particularly, the number of small chips is increased by design according to the application scene requirements, and the method is not limited by the chip size and production of the SoC.
For a further preferable scheme, the functional module includes a logic circuit, an analog circuit, or a memory circuit. The number of the divided cells is two or more.
For a further preferable scheme, the high-speed chip interconnection link requires that the bandwidth is greater than a set threshold, the time delay is less than the set threshold, and the power consumption is less than the set threshold. The high-speed chip interconnection link is a parallel link or a serial link.
As a further preferred scheme, one or more groups of high-speed chip interconnection links are arranged between the two small chips.
For a further preferred scheme, the selection of the parameter setting threshold values of the bandwidth, the time delay and the power consumption is obtained by comparing the mutual transmission state among the functional modules corresponding to the chiplets with the general bus connection realization state in the integrated chip when the plurality of separated chiplets are realized in the manner of the integrated chip.
Compared with the prior art, the invention adopting the technical scheme has the following technical effects: the invention breaks the limitation of the SoC on the area of a single chip and the accompanying performance and computational power limitation. The high production yield of the small chips is improved, the total chip cost is reduced, and the flexible expandability in application configuration and performance is achieved.
Drawings
Fig. 1 is a schematic diagram of a conventional SoC architecture.
FIG. 2 is a schematic diagram of the present invention, in which a small chip architecture is used to replace the SoC architecture.
FIG. 3 is a diagram illustrating a functional split implemented by a chiplet architecture in an embodiment of the present invention.
FIG. 4 is a block diagram illustrating a functional replication implemented by a chiplet architecture in an embodiment of the present invention.
Detailed Description
Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the accompanying drawings are illustrative only for the purpose of explaining the present invention, and are not to be construed as limiting the present invention.
The technical scheme of the invention is further explained in detail by combining the attached drawings:
in the present invention, a schematic diagram of replacing an SoC architecture with a chiplet architecture is shown in fig. 2, and when implementing design of a chiplet, the design can be implemented by dividing into two or more chiplets according to the division of functional modules and the planning of products. The small chips are connected in a butt joint mode through high-speed links, and high-speed interconnection and intercommunication among the small chips are achieved. The design framework can support task division and cooperation among the small chips, and the arrangement of the number of the small chips and the topological structure design can be adjusted according to application requirements by means of the characteristic of distributed computing, so that the calculation power of the scheme is flexibly expanded.
The distributed extensible small chip design architecture is characterized in that the same or different functional module architectures are designed on separate wafers to realize distributed extensible calculation; the functional module architecture realizes specific design through the architecture of a small chip; and the small chips realize interconnection and cooperative work through a high-speed chip interconnection link. In the distributed extensible computing, the distribution is specifically that one computing task is cooperatively executed by two or more separate chiplets. The method can be expanded, particularly, the number of small chips is increased by design according to the application scene requirements, and the method is not limited by the chip size and production of the SoC.
In an embodiment of the present invention, a structural diagram of a functional split implemented by a chiplet architecture is shown in fig. 3, which splits a function of an SoC into two or more chiplets, and has the greatest advantages of being able to develop several chiplets with different functions, adopting a suitable process, and obtaining a better production yield.
In a specific embodiment of the present invention, a schematic structural diagram of implementing function replication by a chiplet architecture is shown in fig. 4, and a key function of an SoC is repeatedly implemented to two or more chiplets, so that the number of chips can be flexibly expanded to meet different application requirements, in addition to the advantage of good yield.
In the design scheme of the invention, the functions can be divided into basic necessary functions and flexible and configurable functions according to the function plan of the small chip. The basic necessary functions in each chiplet can be the same, and the flexible configurable function can be configured in no way, in a single way, or in multiple ways according to the requirement.
The design of high-speed inter-chip interconnection needs to be added in the small chip to be used as a high-speed link for interconnection and intercommunication between chips. According to the interconnection quantity of the small chips and the interconnection bandwidth requirement, the interconnection design among the high-speed chips can be one group or multiple groups, and the design is mainly characterized in that the interconnection efficiency among the small chips is consistent with the efficiency grade of an SoC internal bus. The high-speed chip interconnection link requires that the bandwidth is larger than a set threshold, the time delay is smaller than the set threshold, and the power consumption is smaller than the set threshold.
The chiplets with different functions can be reused for different product combinations, and the design emphasis is that the high-speed inter-chip interconnection design needs to be compatible.
The embodiments of the present invention have been described in detail with reference to the drawings, but the present invention is not limited to the above embodiments, and various changes can be made within the knowledge of those skilled in the art without departing from the gist of the present invention. Although the present invention has been described with reference to a preferred embodiment, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined by the appended claims.
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011573655.XA CN112613264A (en) | 2020-12-25 | 2020-12-25 | Distributed extensible small chip design framework |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011573655.XA CN112613264A (en) | 2020-12-25 | 2020-12-25 | Distributed extensible small chip design framework |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112613264A true CN112613264A (en) | 2021-04-06 |
Family
ID=75248101
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011573655.XA Pending CN112613264A (en) | 2020-12-25 | 2020-12-25 | Distributed extensible small chip design framework |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112613264A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115101106A (en) * | 2022-06-01 | 2022-09-23 | 南京蓝洋智能科技有限公司 | A method for expanding the capacity of SoC SRAM |
CN115617739A (en) * | 2022-09-27 | 2023-01-17 | 南京信息工程大学 | Chiplet architecture-based chip and control method |
Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020046324A1 (en) * | 2000-06-10 | 2002-04-18 | Barroso Luiz Andre | Scalable architecture based on single-chip multiprocessing |
US20110235652A1 (en) * | 2010-03-25 | 2011-09-29 | International Business Machines Corporation | Implementing enhanced link bandwidth in a headless interconnect chip |
CN102246304A (en) * | 2008-11-04 | 2011-11-16 | 全球Oled科技有限责任公司 | Device with chiplets and adaptable interconnections |
CN102799509A (en) * | 2012-07-10 | 2012-11-28 | 中国科学技术大学 | High-bandwidth extendable complex logic verification system based on double FPGA (Field- Programmable Gate Array) chips |
CN103348402A (en) * | 2011-02-10 | 2013-10-09 | 全球Oled科技有限责任公司 | Chiplet display device with serial control |
US20140044157A1 (en) * | 2011-02-14 | 2014-02-13 | The Regents Of The University Of California | Multi-band interconnect for inter-chip and intra-chip communications |
US20150261709A1 (en) * | 2014-03-14 | 2015-09-17 | Emilio Billi | Peripheral component interconnect express (pcie) distributed non- transparent bridging designed for scalability,networking and io sharing enabling the creation of complex architectures. |
CN109542817A (en) * | 2018-11-09 | 2019-03-29 | 中国船舶重工集团公司第七二三研究所 | A kind of general ECM set control framework |
US20200136906A1 (en) * | 2019-04-30 | 2020-04-30 | Francesc Guim Bernat | Modular i/o configurations for edge computing using disaggregated chiplets |
CN111488308A (en) * | 2020-04-17 | 2020-08-04 | 苏州浪潮智能科技有限公司 | System and method for supporting multi-processor expansion of different architectures |
US20200294180A1 (en) * | 2019-03-15 | 2020-09-17 | Intel Corporation | Enabling product skus based on chiplet configurations |
CN111737184A (en) * | 2020-05-29 | 2020-10-02 | 苏州浪潮智能科技有限公司 | An AI server computing unit architecture and implementation method |
CN111741601A (en) * | 2020-07-09 | 2020-10-02 | 复旦大学 | A Universal Configurable Active Substrate Circuit Structure |
-
2020
- 2020-12-25 CN CN202011573655.XA patent/CN112613264A/en active Pending
Patent Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020046324A1 (en) * | 2000-06-10 | 2002-04-18 | Barroso Luiz Andre | Scalable architecture based on single-chip multiprocessing |
CN102246304A (en) * | 2008-11-04 | 2011-11-16 | 全球Oled科技有限责任公司 | Device with chiplets and adaptable interconnections |
US20110235652A1 (en) * | 2010-03-25 | 2011-09-29 | International Business Machines Corporation | Implementing enhanced link bandwidth in a headless interconnect chip |
CN103348402A (en) * | 2011-02-10 | 2013-10-09 | 全球Oled科技有限责任公司 | Chiplet display device with serial control |
US20140044157A1 (en) * | 2011-02-14 | 2014-02-13 | The Regents Of The University Of California | Multi-band interconnect for inter-chip and intra-chip communications |
CN102799509A (en) * | 2012-07-10 | 2012-11-28 | 中国科学技术大学 | High-bandwidth extendable complex logic verification system based on double FPGA (Field- Programmable Gate Array) chips |
US20150261709A1 (en) * | 2014-03-14 | 2015-09-17 | Emilio Billi | Peripheral component interconnect express (pcie) distributed non- transparent bridging designed for scalability,networking and io sharing enabling the creation of complex architectures. |
CN109542817A (en) * | 2018-11-09 | 2019-03-29 | 中国船舶重工集团公司第七二三研究所 | A kind of general ECM set control framework |
US20200294180A1 (en) * | 2019-03-15 | 2020-09-17 | Intel Corporation | Enabling product skus based on chiplet configurations |
US20200136906A1 (en) * | 2019-04-30 | 2020-04-30 | Francesc Guim Bernat | Modular i/o configurations for edge computing using disaggregated chiplets |
CN111488308A (en) * | 2020-04-17 | 2020-08-04 | 苏州浪潮智能科技有限公司 | System and method for supporting multi-processor expansion of different architectures |
CN111737184A (en) * | 2020-05-29 | 2020-10-02 | 苏州浪潮智能科技有限公司 | An AI server computing unit architecture and implementation method |
CN111741601A (en) * | 2020-07-09 | 2020-10-02 | 复旦大学 | A Universal Configurable Active Substrate Circuit Structure |
Non-Patent Citations (3)
Title |
---|
安长俊;周家婕;魏斌;高明;: "基于分布式存储的高性能嵌入式并行处理系统的设计与实现", 电子世界, no. 18, pages 131 - 133 * |
董海涛;陈光胜;王晓辉;吴强;: "基于BT5.0的SoC芯片设计技术研究", 集成电路应用, no. 05, pages 5 - 7 * |
雷航,王茜编著: "《现代微处理器及总线设计》", vol. 1, 国防工业出版社, pages: 352 - 354 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115101106A (en) * | 2022-06-01 | 2022-09-23 | 南京蓝洋智能科技有限公司 | A method for expanding the capacity of SoC SRAM |
CN115617739A (en) * | 2022-09-27 | 2023-01-17 | 南京信息工程大学 | Chiplet architecture-based chip and control method |
CN115617739B (en) * | 2022-09-27 | 2024-02-23 | 南京信息工程大学 | Chip based on Chiplet architecture and control method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112612748A (en) | Super heterogeneous computing method based on extensible small chip architecture | |
CN112613264A (en) | Distributed extensible small chip design framework | |
US11837503B2 (en) | Scalable and flexible architectures for integrated circuit (IC) design and fabrication | |
US10591544B2 (en) | Programmable integrated circuits with in-operation reconfiguration capability | |
US6266797B1 (en) | Data transfer network on a computer chip using a re-configurable path multiple ring topology | |
US11239203B2 (en) | Multi-chip stacked devices | |
US8189603B2 (en) | PCI express to PCI express based low latency interconnect scheme for clustering systems | |
CN111786894B (en) | FPGA device for realizing on-chip network transmission bandwidth expansion function | |
US10886921B1 (en) | Multi-chip stacked devices | |
US11127718B2 (en) | Multi-chip stacked devices | |
US12027512B2 (en) | Chipset and manufacturing method thereof | |
CN110609805B (en) | Method for realizing system-on-chip | |
US20180024618A1 (en) | Power Control in Integrated Circuits | |
US20050015572A1 (en) | Semiconductor integrated circuit | |
CN110192188B (en) | Self-Healing Computing Array | |
CN113656345B (en) | Computing device, computing system and computing method | |
CN112699077B (en) | FPGA chip and interconnection method of FPGA sub-chips | |
CN112631989A (en) | Data transmission method among small chips, among chips and among small chips | |
US10990307B2 (en) | System and method for providing a configurable storage media interface | |
US12190038B1 (en) | Multi-chip module (MCM) with multi-port unified memory | |
Srinivasa et al. | A 300MB SRAM, 20Tb/s Bandwidth Scalable Heterogenous 2.5 D System Inferencing Simultaneous Streams Across 20 Chiplets with Workload-Dependent Configurations | |
WO2021055038A1 (en) | Redundancy scheme for multi-chip stacked devices | |
CN112732631A (en) | Data transmission method between small chips | |
WO2009155762A1 (en) | Array processor structure | |
TWI854950B (en) | Input/output die and system-on-a-chip |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20210406 |
|
RJ01 | Rejection of invention patent application after publication |