[go: up one dir, main page]

CN112613264A - Distributed extensible small chip design framework - Google Patents

Distributed extensible small chip design framework Download PDF

Info

Publication number
CN112613264A
CN112613264A CN202011573655.XA CN202011573655A CN112613264A CN 112613264 A CN112613264 A CN 112613264A CN 202011573655 A CN202011573655 A CN 202011573655A CN 112613264 A CN112613264 A CN 112613264A
Authority
CN
China
Prior art keywords
distributed
scalable
chip
architecture
chiplet
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011573655.XA
Other languages
Chinese (zh)
Inventor
蔡宗宇
陈希恒
韦红芳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Lanyang Intelligent Technology Co ltd
Original Assignee
Nanjing Lanyang Intelligent Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Lanyang Intelligent Technology Co ltd filed Critical Nanjing Lanyang Intelligent Technology Co ltd
Priority to CN202011573655.XA priority Critical patent/CN112613264A/en
Publication of CN112613264A publication Critical patent/CN112613264A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/30Circuit design
    • G06F30/38Circuit design at the mixed level of analogue and digital signals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2115/00Details relating to the type of the circuit
    • G06F2115/02System on chip [SoC] design

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Geometry (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Design And Manufacture Of Integrated Circuits (AREA)
  • Semiconductor Integrated Circuits (AREA)

Abstract

The invention discloses a distributed extensible small chip design framework, which designs the same or different functional module frameworks on separate wafers to realize distributed extensible computation; the functional module architecture realizes specific design through the architecture of a small chip; and the small chips realize interconnection and cooperative work through a high-speed chip interconnection link. The invention breaks the limitation of the SoC on the area of a single chip and the accompanying performance and computational power limitation. The high production yield of the small chips is improved, the total chip cost is reduced, and the flexible expandability in application configuration and performance is achieved.

Description

Distributed extensible small chip design framework
Technical Field
The invention discloses a distributed extensible small chip design framework and relates to the technical field of chip design.
Background
The chip design technology adopted in the market is mainly that only a single die (die) is provided in a single package, such as NVIDIA previous generation architecture Pascal and current latest architecture graphics (ringing), and the number of transistors (Transistor Count) is increased from 12 billion to as much as 18.6 billion, which is increased by 55%. The wafer area is increased from 471mm 2 to 754mm 2 by 60%, which is not the result of the computational advanced process scaling. This means that some designs cannot benefit from process scaling, but rather expensive processes are used for these designs. On the other hand, since the area of a single wafer is so large, the yield of the product is affected only by an atomic defect or a silk impurity in the wafer manufacturing process. In order to avoid the whole die from being scrapped, a backup design and repair circuit must be added to the die, which will significantly reduce the effective utilization rate of the die.
A conventional SoC architecture is shown in fig. 1, which includes several functional modules and functional blobs connected by a system bus. The functional module can be a logic module, an analog module or a memory module. The functional blocks are the aggregation of the same functional modules, and can cumulatively provide stronger functions. The system bus provides high bandwidth inter-module interconnections as a high speed direct path for data transfers.
In order to effectively use the advantages of advanced process technology, a single chip (chip) package is used to carry a plurality of small chips (also called chiplets), so that each small chip can be controlled to a good yield, and the design complexity and corresponding silicon area cost of the backup design and repair circuit are simplified. On the other hand, for designs such as analog circuits that cannot be advantageously implemented in a micro tape manufacturing process, such as a 12 nm or 7 nm process, the designs are concentrated on the chips of the mainstream manufacturing process, such as a 28 nm or 22 nm process, so as to improve the cost performance of the chips. And the flexibility of the chip is also improved by putting the interface function on the small chip. Furthermore, scalability in performance can also be achieved by packaging different numbers of chiplets for different target markets.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: aiming at the defects of the prior art, a distributed extensible small chip design architecture is provided, the design limitation of SoC is broken through, an extensible computing scheme is provided by a small chip (chipset) architecture, flexible configurability and extensibility (scalability) in performance are achieved according to application requirements, and flexible controllable distributed computing capability is provided by distributed computing units and matched analog circuits/memories.
The invention adopts the following technical scheme for solving the technical problems:
a distributed extensible small chip design architecture is characterized in that the same or different functional module architectures are designed on separate wafers to realize distributed extensible computation; the functional module architecture realizes specific design through the architecture of a small chip; and the small chips realize interconnection and cooperative work through a high-speed chip interconnection link.
For a further preferred solution, in the distributed scalable computation, the distribution is specifically that a share of computation task is performed by two or more separate chiplets in a coordinated manner. The method can be expanded, particularly, the number of small chips is increased by design according to the application scene requirements, and the method is not limited by the chip size and production of the SoC.
For a further preferable scheme, the functional module includes a logic circuit, an analog circuit, or a memory circuit. The number of the divided cells is two or more.
For a further preferable scheme, the high-speed chip interconnection link requires that the bandwidth is greater than a set threshold, the time delay is less than the set threshold, and the power consumption is less than the set threshold. The high-speed chip interconnection link is a parallel link or a serial link.
As a further preferred scheme, one or more groups of high-speed chip interconnection links are arranged between the two small chips.
For a further preferred scheme, the selection of the parameter setting threshold values of the bandwidth, the time delay and the power consumption is obtained by comparing the mutual transmission state among the functional modules corresponding to the chiplets with the general bus connection realization state in the integrated chip when the plurality of separated chiplets are realized in the manner of the integrated chip.
Compared with the prior art, the invention adopting the technical scheme has the following technical effects: the invention breaks the limitation of the SoC on the area of a single chip and the accompanying performance and computational power limitation. The high production yield of the small chips is improved, the total chip cost is reduced, and the flexible expandability in application configuration and performance is achieved.
Drawings
Fig. 1 is a schematic diagram of a conventional SoC architecture.
FIG. 2 is a schematic diagram of the present invention, in which a small chip architecture is used to replace the SoC architecture.
FIG. 3 is a diagram illustrating a functional split implemented by a chiplet architecture in an embodiment of the present invention.
FIG. 4 is a block diagram illustrating a functional replication implemented by a chiplet architecture in an embodiment of the present invention.
Detailed Description
Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the accompanying drawings are illustrative only for the purpose of explaining the present invention, and are not to be construed as limiting the present invention.
The technical scheme of the invention is further explained in detail by combining the attached drawings:
in the present invention, a schematic diagram of replacing an SoC architecture with a chiplet architecture is shown in fig. 2, and when implementing design of a chiplet, the design can be implemented by dividing into two or more chiplets according to the division of functional modules and the planning of products. The small chips are connected in a butt joint mode through high-speed links, and high-speed interconnection and intercommunication among the small chips are achieved. The design framework can support task division and cooperation among the small chips, and the arrangement of the number of the small chips and the topological structure design can be adjusted according to application requirements by means of the characteristic of distributed computing, so that the calculation power of the scheme is flexibly expanded.
The distributed extensible small chip design architecture is characterized in that the same or different functional module architectures are designed on separate wafers to realize distributed extensible calculation; the functional module architecture realizes specific design through the architecture of a small chip; and the small chips realize interconnection and cooperative work through a high-speed chip interconnection link. In the distributed extensible computing, the distribution is specifically that one computing task is cooperatively executed by two or more separate chiplets. The method can be expanded, particularly, the number of small chips is increased by design according to the application scene requirements, and the method is not limited by the chip size and production of the SoC.
In an embodiment of the present invention, a structural diagram of a functional split implemented by a chiplet architecture is shown in fig. 3, which splits a function of an SoC into two or more chiplets, and has the greatest advantages of being able to develop several chiplets with different functions, adopting a suitable process, and obtaining a better production yield.
In a specific embodiment of the present invention, a schematic structural diagram of implementing function replication by a chiplet architecture is shown in fig. 4, and a key function of an SoC is repeatedly implemented to two or more chiplets, so that the number of chips can be flexibly expanded to meet different application requirements, in addition to the advantage of good yield.
In the design scheme of the invention, the functions can be divided into basic necessary functions and flexible and configurable functions according to the function plan of the small chip. The basic necessary functions in each chiplet can be the same, and the flexible configurable function can be configured in no way, in a single way, or in multiple ways according to the requirement.
The design of high-speed inter-chip interconnection needs to be added in the small chip to be used as a high-speed link for interconnection and intercommunication between chips. According to the interconnection quantity of the small chips and the interconnection bandwidth requirement, the interconnection design among the high-speed chips can be one group or multiple groups, and the design is mainly characterized in that the interconnection efficiency among the small chips is consistent with the efficiency grade of an SoC internal bus. The high-speed chip interconnection link requires that the bandwidth is larger than a set threshold, the time delay is smaller than the set threshold, and the power consumption is smaller than the set threshold.
The chiplets with different functions can be reused for different product combinations, and the design emphasis is that the high-speed inter-chip interconnection design needs to be compatible.
The embodiments of the present invention have been described in detail with reference to the drawings, but the present invention is not limited to the above embodiments, and various changes can be made within the knowledge of those skilled in the art without departing from the gist of the present invention. Although the present invention has been described with reference to a preferred embodiment, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (9)

1.一种分布式可拓展的小芯片设计架构,其特征在于:1. a distributed scalable chiplet design architecture, characterized in that: 在分开的晶元上设计相同或不同的功能模块架构,实现的分布式可扩展计算;Design the same or different functional module architectures on separate wafers to achieve distributed and scalable computing; 所述功能模块架构中通过小芯片的架构实现具体设计;In the functional module architecture, the specific design is realized through the architecture of the small chip; 所述小芯片间通过高速芯片互联链路实现互联互通以及协同工作。The small chips realize interconnection and cooperation through high-speed chip interconnection links. 2.如权利要求1所述的一种分布式可拓展的小芯片设计架构,其特征在于:所述分布式可扩展计算中,分布式具体为将一份计算任务由两个或多个分开的小芯片来协同执行完成。2. A distributed scalable chiplet design architecture according to claim 1, wherein: in the distributed scalable computing, the distributed computing task is specifically divided by two or more The chiplets are executed cooperatively. 3.如权利要求1所述的一种分布式可拓展的小芯片设计架构,其特征在于:所述分布式可扩展计算中,可扩展具体为根据应用场景需求,设计增加小芯片数量,不受SoC的芯片尺寸及生产限制。3. A distributed and scalable chiplet design architecture according to claim 1, characterized in that: in the distributed scalable computing, scalability is specifically designed to increase the number of chiplets according to the requirements of application scenarios, and not Limited by the chip size and production of the SoC. 4.如权利要求1所述的一种分布式可拓展的小芯片设计架构,其特征在于:所述功能模块,包括逻辑电路、模拟电路或者內存电路。4 . The distributed and scalable chiplet design architecture of claim 1 , wherein the functional modules include logic circuits, analog circuits or memory circuits. 5 . 5.如权利要求1所述的一种分布式可拓展的小芯片设计架构,其特征在于:所述分开的晶元,包含的晶元数量是两颗或两颗以上。5 . The distributed and scalable chiplet design architecture as claimed in claim 1 , wherein the divided wafers contain two or more wafers. 6 . 6.如权利要求1所述的一种分布式可拓展的小芯片设计架构,其特征在于:所述高速芯片互联链路要求带宽大于设定的阈值、时延小于设定的阈值,且功耗小于设定的阈值。6. A distributed and scalable chiplet design architecture according to claim 1, wherein the high-speed chip interconnection link requires a bandwidth greater than a set threshold, a delay less than a set threshold, and a functional consumption is less than the set threshold. 7.如权利要求1所述的一种分布式可拓展的小芯片设计架构,其特征在于:所述高速芯片互联链路为并行链路或串行链路。7 . The distributed and scalable chiplet design architecture of claim 1 , wherein the high-speed chip interconnect link is a parallel link or a serial link. 8 . 8.如权利要求1所述的一种分布式可拓展的小芯片设计架构,其特征在于:在两个小芯片之间,设置一组或者一组以上的高速芯片互联链路。8 . The distributed and scalable chiplet design architecture according to claim 1 , wherein a group or more than one group of high-speed chip interconnect links are set between two chiplets. 9 . 9.如权利要求6所述的一种分布式可拓展的小芯片设计架构,其特征在于:所述带宽、时延和功耗的参数设定阈值的选取,是将若干分开的小芯片改以集成芯片的方式实现时,将小芯片对应的功能模块间互相传输状态与集成芯片内一般的总线连接实现状态进行对比得出。9. A kind of distributed scalable chiplet design architecture as claimed in claim 6, it is characterized in that: the selection of the parameter setting thresholds of the bandwidth, time delay and power consumption is to change a number of separate chiplets. When implemented in the form of an integrated chip, the mutual transmission state between the functional modules corresponding to the small chip is compared with the general bus connection implementation state in the integrated chip.
CN202011573655.XA 2020-12-25 2020-12-25 Distributed extensible small chip design framework Pending CN112613264A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011573655.XA CN112613264A (en) 2020-12-25 2020-12-25 Distributed extensible small chip design framework

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011573655.XA CN112613264A (en) 2020-12-25 2020-12-25 Distributed extensible small chip design framework

Publications (1)

Publication Number Publication Date
CN112613264A true CN112613264A (en) 2021-04-06

Family

ID=75248101

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011573655.XA Pending CN112613264A (en) 2020-12-25 2020-12-25 Distributed extensible small chip design framework

Country Status (1)

Country Link
CN (1) CN112613264A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115101106A (en) * 2022-06-01 2022-09-23 南京蓝洋智能科技有限公司 A method for expanding the capacity of SoC SRAM
CN115617739A (en) * 2022-09-27 2023-01-17 南京信息工程大学 Chiplet architecture-based chip and control method

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020046324A1 (en) * 2000-06-10 2002-04-18 Barroso Luiz Andre Scalable architecture based on single-chip multiprocessing
US20110235652A1 (en) * 2010-03-25 2011-09-29 International Business Machines Corporation Implementing enhanced link bandwidth in a headless interconnect chip
CN102246304A (en) * 2008-11-04 2011-11-16 全球Oled科技有限责任公司 Device with chiplets and adaptable interconnections
CN102799509A (en) * 2012-07-10 2012-11-28 中国科学技术大学 High-bandwidth extendable complex logic verification system based on double FPGA (Field- Programmable Gate Array) chips
CN103348402A (en) * 2011-02-10 2013-10-09 全球Oled科技有限责任公司 Chiplet display device with serial control
US20140044157A1 (en) * 2011-02-14 2014-02-13 The Regents Of The University Of California Multi-band interconnect for inter-chip and intra-chip communications
US20150261709A1 (en) * 2014-03-14 2015-09-17 Emilio Billi Peripheral component interconnect express (pcie) distributed non- transparent bridging designed for scalability,networking and io sharing enabling the creation of complex architectures.
CN109542817A (en) * 2018-11-09 2019-03-29 中国船舶重工集团公司第七二三研究所 A kind of general ECM set control framework
US20200136906A1 (en) * 2019-04-30 2020-04-30 Francesc Guim Bernat Modular i/o configurations for edge computing using disaggregated chiplets
CN111488308A (en) * 2020-04-17 2020-08-04 苏州浪潮智能科技有限公司 System and method for supporting multi-processor expansion of different architectures
US20200294180A1 (en) * 2019-03-15 2020-09-17 Intel Corporation Enabling product skus based on chiplet configurations
CN111737184A (en) * 2020-05-29 2020-10-02 苏州浪潮智能科技有限公司 An AI server computing unit architecture and implementation method
CN111741601A (en) * 2020-07-09 2020-10-02 复旦大学 A Universal Configurable Active Substrate Circuit Structure

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020046324A1 (en) * 2000-06-10 2002-04-18 Barroso Luiz Andre Scalable architecture based on single-chip multiprocessing
CN102246304A (en) * 2008-11-04 2011-11-16 全球Oled科技有限责任公司 Device with chiplets and adaptable interconnections
US20110235652A1 (en) * 2010-03-25 2011-09-29 International Business Machines Corporation Implementing enhanced link bandwidth in a headless interconnect chip
CN103348402A (en) * 2011-02-10 2013-10-09 全球Oled科技有限责任公司 Chiplet display device with serial control
US20140044157A1 (en) * 2011-02-14 2014-02-13 The Regents Of The University Of California Multi-band interconnect for inter-chip and intra-chip communications
CN102799509A (en) * 2012-07-10 2012-11-28 中国科学技术大学 High-bandwidth extendable complex logic verification system based on double FPGA (Field- Programmable Gate Array) chips
US20150261709A1 (en) * 2014-03-14 2015-09-17 Emilio Billi Peripheral component interconnect express (pcie) distributed non- transparent bridging designed for scalability,networking and io sharing enabling the creation of complex architectures.
CN109542817A (en) * 2018-11-09 2019-03-29 中国船舶重工集团公司第七二三研究所 A kind of general ECM set control framework
US20200294180A1 (en) * 2019-03-15 2020-09-17 Intel Corporation Enabling product skus based on chiplet configurations
US20200136906A1 (en) * 2019-04-30 2020-04-30 Francesc Guim Bernat Modular i/o configurations for edge computing using disaggregated chiplets
CN111488308A (en) * 2020-04-17 2020-08-04 苏州浪潮智能科技有限公司 System and method for supporting multi-processor expansion of different architectures
CN111737184A (en) * 2020-05-29 2020-10-02 苏州浪潮智能科技有限公司 An AI server computing unit architecture and implementation method
CN111741601A (en) * 2020-07-09 2020-10-02 复旦大学 A Universal Configurable Active Substrate Circuit Structure

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
安长俊;周家婕;魏斌;高明;: "基于分布式存储的高性能嵌入式并行处理系统的设计与实现", 电子世界, no. 18, pages 131 - 133 *
董海涛;陈光胜;王晓辉;吴强;: "基于BT5.0的SoC芯片设计技术研究", 集成电路应用, no. 05, pages 5 - 7 *
雷航,王茜编著: "《现代微处理器及总线设计》", vol. 1, 国防工业出版社, pages: 352 - 354 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115101106A (en) * 2022-06-01 2022-09-23 南京蓝洋智能科技有限公司 A method for expanding the capacity of SoC SRAM
CN115617739A (en) * 2022-09-27 2023-01-17 南京信息工程大学 Chiplet architecture-based chip and control method
CN115617739B (en) * 2022-09-27 2024-02-23 南京信息工程大学 Chip based on Chiplet architecture and control method

Similar Documents

Publication Publication Date Title
CN112612748A (en) Super heterogeneous computing method based on extensible small chip architecture
CN112613264A (en) Distributed extensible small chip design framework
US11837503B2 (en) Scalable and flexible architectures for integrated circuit (IC) design and fabrication
US10591544B2 (en) Programmable integrated circuits with in-operation reconfiguration capability
US6266797B1 (en) Data transfer network on a computer chip using a re-configurable path multiple ring topology
US11239203B2 (en) Multi-chip stacked devices
US8189603B2 (en) PCI express to PCI express based low latency interconnect scheme for clustering systems
CN111786894B (en) FPGA device for realizing on-chip network transmission bandwidth expansion function
US10886921B1 (en) Multi-chip stacked devices
US11127718B2 (en) Multi-chip stacked devices
US12027512B2 (en) Chipset and manufacturing method thereof
CN110609805B (en) Method for realizing system-on-chip
US20180024618A1 (en) Power Control in Integrated Circuits
US20050015572A1 (en) Semiconductor integrated circuit
CN110192188B (en) Self-Healing Computing Array
CN113656345B (en) Computing device, computing system and computing method
CN112699077B (en) FPGA chip and interconnection method of FPGA sub-chips
CN112631989A (en) Data transmission method among small chips, among chips and among small chips
US10990307B2 (en) System and method for providing a configurable storage media interface
US12190038B1 (en) Multi-chip module (MCM) with multi-port unified memory
Srinivasa et al. A 300MB SRAM, 20Tb/s Bandwidth Scalable Heterogenous 2.5 D System Inferencing Simultaneous Streams Across 20 Chiplets with Workload-Dependent Configurations
WO2021055038A1 (en) Redundancy scheme for multi-chip stacked devices
CN112732631A (en) Data transmission method between small chips
WO2009155762A1 (en) Array processor structure
TWI854950B (en) Input/output die and system-on-a-chip

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20210406

RJ01 Rejection of invention patent application after publication