ENN: a neural network with DCT adaptive activation functions

dc.contributor
Universitat Politècnica de Catalunya. Doctorat en Teoria del Senyal i Comunicacions
dc.contributor
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.contributor.author
Martínez Gost, Marc
dc.contributor.author
Pérez Neira, Ana Isabel
dc.contributor.author
Lagunas Hernandez, Miguel A.
dc.date.issued
2024-03
dc.identifier
M. Gost, M.; Perez Neira, A.; Lagunas, M. ENN: a neural network with DCT adaptive activation functions. "IEEE journal of selected topics in signal processing", Març 2024, vol. 18, núm. 2, p. 232-241.
dc.identifier
1941-0484
dc.identifier
https://hdl.handle.net/2117/405515
dc.identifier
10.1109/JSTSP.2024.3361154
dc.description.abstract
The expressiveness of neural networks highly depends on the nature of the activation function, although these are usually assumed predefined and fixed during the training stage. Under a signal processing perspective, in this paper we present Expressive Neural Network (ENN), a novel model in which the non-linear activation functions are modeled using the Discrete Cosine Transform (DCT) and adapted using backpropagation during training. This parametrization keeps the number of trainable parameters low, is appropriate for gradient-based schemes, and adapts to different learning tasks. This is the first non-linear model for activation functions that relies on a signal processing perspective, providing high flexibility and expressiveness to the network. We contribute with insights in the explainability of the network at convergence by recovering the concept of bump, this is, the response of each activation function in the output space. Finally, through exhaustive experiments we show that the model can adapt to classification and regression tasks. The performance of ENN outperforms state of the art benchmarks, providing above a 40% gap in accuracy in some scenarios.
dc.description.abstract
This work is part of the project IRENE (PID2020-115323RB-C31), funded by MCIN/AEI/10.13039/501100011033.
dc.description.abstract
Peer Reviewed
dc.description.abstract
Postprint (published version)
dc.format
10 p.
dc.format
application/pdf
dc.language
eng
dc.publisher
Institute of Electrical and Electronics Engineers (IEEE)
dc.relation
https://ieeexplore.ieee.org/document/10418453
dc.rights
http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights
Open Access
dc.rights
Attribution-NonCommercial-NoDerivatives 4.0 International
dc.subject
Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal
dc.subject
Signal processing
dc.subject
Neural networks (Computer science)
dc.subject
Machine learning
dc.subject
Adaptive activation functions
dc.subject
Discrete cosine transform
dc.subject
Explainable machine learning
dc.subject
Tractament del senyal
dc.subject
Xarxes neuronals (Informàtica)
dc.subject
Aprenentatge automàtic
dc.title
ENN: a neural network with DCT adaptive activation functions
dc.type
Article


Ficheros en el ítem

FicherosTamañoFormatoVer

No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

E-prints [72954]