Related Videos - From ReLU to GeMU: Activation functions in the lens of cone projection

Jiayun Li ¹, Yuxiao Cheng ², Yiwen Lu ³, Zhuofan Xia ⁴, Yilin Mo ⁵, Gao Huang ⁶

¹Department of Automation and BNRist, Tsinghua University, Beijing, 100084, China. Electronic address: lijiayun22@mails.tsinghua.edu.cn.
²Department of Automation and BNRist, Tsinghua University, Beijing, 100084, China. Electronic address: cyx22@mails.tsinghua.edu.cn.
³Department of Automation and BNRist, Tsinghua University, Beijing, 100084, China. Electronic address: luyw20@mails.tsinghua.edu.cn.
⁴Department of Automation and BNRist, Tsinghua University, Beijing, 100084, China. Electronic address: xzf23@mails.tsinghua.edu.cn.
⁵Department of Automation and BNRist, Tsinghua University, Beijing, 100084, China. Electronic address: ylmo@tsinghua.edu.cn.
⁶Department of Automation and BNRist, Tsinghua University, Beijing, 100084, China. Electronic address: gaohuang@tsinghua.edu.cn.

Abstract

Activation functions are essential to introduce nonlinearity into neural networks, with the Rectified Linear Unit (ReLU) often favored for its simplicity and effectiveness. Motivated by the structural similarity between a single layer of the Feedforward Neural Network (FNN) and a single iteration of the Projected Gradient Descent (PGD) algorithm for constrained optimization problems, we consider ReLU as a projection from R onto the nonnegative half-line R₊. Building on this interpretation, we generalize ReLU to a Generalized Multivariate projection Unit (GeMU), a projection operator onto a convex cone, such as the Second-Order Cone (SOC). We prove that the expressive power of FNNs activated by our proposed GeMU is strictly greater than those activated by ReLU. Experimental evaluations further corroborate that GeMU is versatile across prevalent architectures and distinct tasks, and that it can outperform various existing activation functions.

From ReLU to GeMU: Activation functions in the lens of cone projection

Computational Modeling of Retinal Neurons for Visual Prosthesis Research - Fundamental Approaches

A Method for 3D Reconstruction and Virtual Reality Analysis of Glial and Neuronal Cells

Cone-Enriched Cultures from the Retina of Chicken Embryos to Study Rod to Cone Cellular Interactions

Abstract

Computational Modeling of Retinal Neurons for Visual Prosthesis Research - Fundamental Approaches

A Method for 3D Reconstruction and Virtual Reality Analysis of Glial and Neuronal Cells

Cone-Enriched Cultures from the Retina of Chicken Embryos to Study Rod to Cone Cellular Interactions

Convolution Properties I

Convolution Properties II

Convolution: Math, Graphics, and Discrete Signals

Vector Representation of Complex Numbers

Gradient and Del Operator

Coordinates and Map Projections

ABOUT JoVE

AUTHORS

LIBRARIANS

RESEARCH

EDUCATION

From ReLU to GeMU: Activation functions in the lens of cone projection

Related Experiment Videos These videos have been matched automatically. Contact us if they are not relevant.

Computational Modeling of Retinal Neurons for Visual Prosthesis Research - Fundamental Approaches

A Method for 3D Reconstruction and Virtual Reality Analysis of Glial and Neuronal Cells

Cone-Enriched Cultures from the Retina of Chicken Embryos to Study Rod to Cone Cellular Interactions

Abstract

Related Experiment Videos These videos have been matched automatically. Contact us if they are not relevant.

Computational Modeling of Retinal Neurons for Visual Prosthesis Research - Fundamental Approaches

A Method for 3D Reconstruction and Virtual Reality Analysis of Glial and Neuronal Cells

Cone-Enriched Cultures from the Retina of Chicken Embryos to Study Rod to Cone Cellular Interactions

Related Concept Videos

Convolution Properties I

Convolution Properties II

Convolution: Math, Graphics, and Discrete Signals

Vector Representation of Complex Numbers

Gradient and Del Operator

Coordinates and Map Projections

Share

Related Experiment Videos

These videos have been matched automatically. Contact us if they are not relevant.

Related Experiment Videos

These videos have been matched automatically. Contact us if they are not relevant.