Sign In to Follow Application
View All Documents & Correspondence

Apparatuses, Methods, And Systems For Instructions Of A Matrix Operationsaccelerator

Abstract: ABSTRACT APPARATUSES, METHODS, AND SYSTEMS FOR INSTRUCTIONS OF A MATRIX OPERATIONS ACCELERATOR Systems, methods, and apparatuses relating to a matrix operations accelerator are described. In one embodiment, a processor includes a matrix operations accelerator circuit that includes a two-dimensional grid of fused multiply accumulate circuits that is switchable from a first mode where a respective output of each of a first proper subset of fused multiply accumulate circuits of the two-dimensional grid is transmitted downstream to a respective input of each of a second proper subset of fused multiply accumulate circuits of the two-dimensional grid to form output values from at least one first input two-dimensional matrix and at least one second input two-dimensional matrix, and store the output values in resultant storage, to a second mode where the respective output of each of the first proper subset of fused multiply accumulate circuits of the two-dimensional grid form first output values from a first subset of the at least one first input two-dimensional matrix and the at least one second input two-dimensional matrix, and store the first output values in the resultant storage, and a respective output of each of the second proper subset of fused multiply accumulate circuits of the two-dimensional grid form second output values from a second subset of the at least one first input two-dimensional matrix and the at least one second input two-dimensional matrix, and store the second output values in the resultant storage.

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #
Filing Date
20 June 2020
Publication Number
14/2021
Publication Type
INA
Invention Field
COMPUTER SCIENCE
Status
Email
ipo@iphorizons.com
Parent Application

Applicants

INTEL CORPORATION
2200 Mission College Boulevard, Santa Clara, California 95054, USA

Inventors

1. Alexander HEINECKE
60 Descanso Drive, #3106 San Jose, CA, 95134 USA
2. Kamlesh R. PILLAI
Intel Technology India Pvt. Ltd. No. 136, Old Airport Road, Bangalore Karnataka, India, 560017
3. Christopher J. HUGHES
3543 Druffel PI Santa Clara, CA, 95051 USA

Specification

Claims:WE CLAIM:

1. An apparatus comprising:
a matrix operations accelerator circuit comprising a two-dimensional grid of fused multiply accumulate circuits;
a first plurality of registers that represents at least one first input two-dimensional matrix coupled to the matrix operations accelerator circuit;
a second plurality of registers that represents at least one second input two-dimensional matrix coupled to the matrix operations accelerator circuit;
a decoder, of a core coupled to the matrix operations accelerator circuit, to decode a single instruction into a decoded single instruction, the single instruction including a field that identifies a resultant storage; and
an execution circuit of the core to execute the decoded single instruction to:
switch the matrix operations accelerator circuit from a first mode where a respective output of each of a first proper subset of fused multiply accumulate circuits of the two-dimensional grid is transmitted downstream to a respective input of each of a second proper subset of fused multiply accumulate circuits of the two-dimensional grid to form output values from the at least one first input two-dimensional matrix and the at least one second input two-dimensional matrix, and store the output values in the resultant storage, to a second mode where the respective output of each of the first proper subset of fused multiply accumulate circuits of the two-dimensional grid form first output values from a first subset of the at least one first input two-dimensional matrix and the at least one second input two-dimensional matrix, and store the first output values in the resultant storage, and a respective output of each of the second proper subset of fused multiply accumulate circuits of the two-dimensional grid form second output values from a second subset of the at least one first input two-dimensional matrix and the at least one second input two-dimensional matrix, and store the second output values in the resultant storage.

2. The apparatus of claim 1, wherein the single instruction comprises a second field to indicate the matrix operations accelerator circuit is to execute in the first mode when the second field is a first value and in the second mode when the second field is a second value.

3. The apparatus of claim 2, wherein the second field is an immediate of the single instruction.

4. The apparatus of claim 1, wherein the resultant storage is a third plurality of registers that represents at least one output two-dimensional matrix formed by execution of the decoded single instruction.

5. The apparatus of claim 4, wherein the execution of the decoded single instruction is to:
in the first mode, add values from the third plurality of registers that represents at least one third input two-dimensional matrix initially stored in the third plurality of registers to the output values to form updated output values and store the updated output values, instead of the output values, into the resultant storage, and
in the second mode, add values from the at least one third input two-dimensional matrix initially stored in the third plurality of registers to the first output values and the second output values to form updated first output values and updated second output values and store the updated first output values and updated second output values, instead of the first output values and the second output values, into the resultant storage.

6. The apparatus of claim 1, wherein the execution of the decoded single instruction is to:
in the first mode, add values from at least one third input two-dimensional matrix initially stored in a third plurality of registers to the output values to form updated output values and store the updated output values, instead of the output values, into the resultant storage, and
in the second mode, add values from the at least one third input two-dimensional matrix initially stored in the third plurality of registers to the first output values and the second output values to form updated first output values and updated second output values and store the updated first output values and updated second output values, instead of the first output values and the second output values, into the resultant storage.

7. The apparatus of claim 1, wherein the resultant storage is a third plurality of registers that represents a plurality of output two-dimensional matrices formed by execution of the decoded single instruction.

8. The apparatus of any one of claims 1-7, wherein the first proper subset of fused multiply accumulate circuits is one of a row or a column of the two-dimensional grid of fused multiply accumulate circuits and the second proper subset of fused multiply accumulate circuits is another of the one of the row or the column of the two-dimensional grid of fused multiply accumulate circuits.

9. A method comprising:
decoding, with a decoder of a processor core, a single instruction into a decoded single instruction, wherein the processor core is coupled to a matrix operations accelerator circuit comprising a two-dimensional grid of fused multiply accumulate circuits, the matrix operations accelerator circuit is coupled to a first plurality of registers that represents at least one first input two-dimensional matrix and a second plurality of registers that represents at least one second input two-dimensional matrix, and the single instruction includes a field that identifies a resultant storage; and
executing the decoded single instruction with an execution circuit of the processor core to:
switch the matrix operations accelerator circuit from a first mode where a respective output of each of a first proper subset of fused multiply accumulate circuits of the two-dimensional grid is transmitted downstream to a respective input of each of a second proper subset of fused multiply accumulate circuits of the two-dimensional grid to form output values from the at least one first input two-dimensional matrix and the at least one second input two-dimensional matrix, and store the output values in the resultant storage, to a second mode where the respective output of each of the first proper subset of fused multiply accumulate circuits of the two-dimensional grid form first output values from a first subset of the at least one first input two-dimensional matrix and the at least one second input two-dimensional matrix, and store the first output values in the resultant storage, and a respective output of each of the second proper subset of fused multiply accumulate circuits of the two-dimensional grid form second output values from a second subset of the at least one first input two-dimensional matrix and the at least one second input two-dimensional matrix, and store the second output values in the resultant storage.

10. The method of claim 9, wherein the single instruction comprises a second field indicating that the matrix operations accelerator circuit is to execute in the first mode when the second field is a first value and in the second mode when the second field is a second value.

11. The method of claim 10, wherein the second field is an immediate of the single instruction.

12. The method of claim 9, wherein the resultant storage is a third plurality of registers that represents at least one output two-dimensional matrix formed by execution of the decoded single instruction.

13. The method of claim 12, wherein the executing the decoded single instruction is to:
in the first mode, add values from the third plurality of registers that represents at least one third input two-dimensional matrix initially stored in the third plurality of registers to the output values to form updated output values and store the updated output values, instead of the output values, into the resultant storage, and
in the second mode, add values from the at least one third input two-dimensional matrix initially stored in the third plurality of registers to the first output values and the second output values to form updated first output values and updated second output values and store the updated first output values and updated second output values, instead of the first output values and the second output values, into the resultant storage.

14. The method of claim 9, wherein the executing the decoded single instruction is to:
in the first mode, add values from at least one third input two-dimensional matrix initially stored in a third plurality of registers to the output values to form updated output values and store the updated output values, instead of the output values, into the resultant storage, and
in the second mode, add values from the at least one third input two-dimensional matrix initially stored in the third plurality of registers to the first output values and the second output values to form updated first output values and updated second output values and store the updated first output values and updated second output values, instead of the first output values and the second output values, into the resultant storage.

15. The method of claim 9, wherein the resultant storage is a third plurality of registers that represents a plurality of output two-dimensional matrices formed by execution of the decoded single instruction.

16. The method of any one of claims 9-15, wherein the first proper subset of fused multiply accumulate circuits is one of a row or a column of the two-dimensional grid of fused multiply accumulate circuits and the second proper subset of fused multiply accumulate circuits is another of the one of the row or the column of the two-dimensional grid of fused multiply accumulate circuits.

Documents

Application Documents

# Name Date
1 202044026065-ABSTRACT [18-10-2022(online)].pdf 2022-10-18
1 202044026065-FORM 1 [20-06-2020(online)].pdf 2020-06-20
2 202044026065-CLAIMS [18-10-2022(online)].pdf 2022-10-18
2 202044026065-DRAWINGS [20-06-2020(online)].pdf 2020-06-20
3 202044026065-FER_SER_REPLY [18-10-2022(online)].pdf 2022-10-18
3 202044026065-DECLARATION OF INVENTORSHIP (FORM 5) [20-06-2020(online)].pdf 2020-06-20
4 202044026065-OTHERS [18-10-2022(online)].pdf 2022-10-18
4 202044026065-COMPLETE SPECIFICATION [20-06-2020(online)].pdf 2020-06-20
5 202044026065-Proof of Right [17-10-2022(online)].pdf 2022-10-17
5 202044026065-FORM-26 [14-09-2020(online)].pdf 2020-09-14
6 202044026065-FORM 3 [15-12-2020(online)].pdf 2020-12-15
6 202044026065-FORM 3 [12-10-2022(online)].pdf 2022-10-12
7 202044026065-Information under section 8(2) [12-10-2022(online)].pdf 2022-10-12
7 202044026065-FORM 18 [19-05-2021(online)].pdf 2021-05-19
8 202044026065-FORM 3 [18-06-2021(online)].pdf 2021-06-18
8 202044026065-FER.pdf 2022-04-18
9 202044026065-FER.pdf 2022-04-18
9 202044026065-FORM 3 [18-06-2021(online)].pdf 2021-06-18
10 202044026065-FORM 18 [19-05-2021(online)].pdf 2021-05-19
10 202044026065-Information under section 8(2) [12-10-2022(online)].pdf 2022-10-12
11 202044026065-FORM 3 [12-10-2022(online)].pdf 2022-10-12
11 202044026065-FORM 3 [15-12-2020(online)].pdf 2020-12-15
12 202044026065-FORM-26 [14-09-2020(online)].pdf 2020-09-14
12 202044026065-Proof of Right [17-10-2022(online)].pdf 2022-10-17
13 202044026065-COMPLETE SPECIFICATION [20-06-2020(online)].pdf 2020-06-20
13 202044026065-OTHERS [18-10-2022(online)].pdf 2022-10-18
14 202044026065-DECLARATION OF INVENTORSHIP (FORM 5) [20-06-2020(online)].pdf 2020-06-20
14 202044026065-FER_SER_REPLY [18-10-2022(online)].pdf 2022-10-18
15 202044026065-CLAIMS [18-10-2022(online)].pdf 2022-10-18
15 202044026065-DRAWINGS [20-06-2020(online)].pdf 2020-06-20
16 202044026065-ABSTRACT [18-10-2022(online)].pdf 2022-10-18
16 202044026065-FORM 1 [20-06-2020(online)].pdf 2020-06-20
17 202044026065-US(14)-HearingNotice-(HearingDate-18-11-2025).pdf 2025-10-25
18 202044026065-Correspondence to notify the Controller [27-10-2025(online)].pdf 2025-10-27
19 202044026065-Correspondence to notify the Controller [05-11-2025(online)].pdf 2025-11-05

Search Strategy

1 SearchHistory(67)E_13-04-2022.pdf