Parallel convolutional processing using an integrated photonic tensor core

  • 0Institute of Physics, University of Münster, Münster, Germany.

|

|

Summary

This summary is machine-generated.

Researchers developed a photonic tensor core, an optical hardware accelerator, performing trillions of multiply-accumulate operations per second. This integrated photonic device offers a path towards faster, scalable AI hardware for data-intensive applications.

Area Of Science

  • Integrated Photonics
  • Optical Computing
  • Artificial Intelligence Hardware

Background

  • Exponential data growth from mobile networks, IoT, and AI necessitates faster, efficient hardware.
  • Current hardware limitations in speed and scalability for processing massive datasets.
  • Need for specialized hardware accelerators for computationally intensive AI tasks.

Purpose Of The Study

  • To demonstrate a computationally specific integrated photonic hardware accelerator (tensor core).
  • To achieve high-speed, parallelized in-memory computing using photonic technologies.
  • To explore the potential of integrated photonics for future AI hardware.

Main Methods

  • Developed a photonic tensor core utilizing phase-change-material memory arrays.
  • Employed photonic chip-based optical frequency combs (soliton microcombs) for computation.
  • Reduced computation to measuring optical transmission through reconfigurable passive components.

Main Results

  • Achieved operational speeds of trillions of multiply-accumulate operations per second (tera-MACs/s).
  • Demonstrated computation bandwidth exceeding 14 gigahertz, limited by modulator and photodetector speeds.
  • Showcased a pathway towards CMOS wafer-scale integration of the photonic tensor core.

Conclusions

  • The photonic tensor core represents a significant advancement in optical computing hardware.
  • Integrated photonics offers a promising solution for parallel, fast, and efficient AI computation.
  • The technology has potential applications in autonomous driving, live video processing, and cloud computing.

Related Concept Videos

Convolution Properties II 01:17

448

The important convolution properties include width, area, differentiation, and integration properties.
The width property indicates that if the durations of input signals are T1 and T2, then the width of the output response equals the sum of both durations, irrespective of the shapes of the two functions. For instance, convolving two rectangular pulses with durations of 2 seconds and 1 second results in a function with a width of 3 seconds.
The area property asserts that the area under the...

Convolution Properties I 01:20

379

Convolution computations can be simplified by utilizing their inherent properties.
The commutative property reveals that the input and the impulse response of an LTI (Linear Time-Invariant) system can be interchanged without affecting the output:

The associative property suggests that the merged convolution of three functions remains unchanged regardless of the sequence of convolution. For instance, for three functions x(t), h(t), and g(t) is written as,

When two LTI systems with impulse...

Parallel Processing 01:20

444

The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...

Convolution: Math, Graphics, and Discrete Signals 01:24

673

In any LTI (Linear Time-Invariant) system, the convolution of two signals is denoted using a convolution operator, assuming all initial conditions are zero. The convolution integral can be divided into two parts: the zero-input or natural response and the zero-state or forced response, with t0 indicating the initial time.
To simplify the convolution integral, it is assumed that both the input signal and impulse response are zero for negative time values. The graphical convolution process...

Deconvolution 01:20

418

Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...

Computed Tomography 01:10

7.6K

Tomography refers to imaging by sections. Computed tomography (CT) is a non-invasive imaging technique that uses computers to analyze several cross-sectional X-rays to reveal minute details about structures in the body.
The technique was invented in the 1970s and is based on the principle that as X-rays pass through the body, they are absorbed or reflected at different levels. In the technique, a patient lies on a motorized platform while a computerized axial tomography (CAT) scanner rotates...