Record Detail

Advanced Search

Text

Studying the Evolution of Neural Activation Patterns During Training of Feed-Forward ReLU Networks

David Hartmann - Personal Name
Daniel Franzen - Personal Name
Sebastian Brodehl - Personal Name
Zahra Ahmadi - Personal Name

The ability of deep neural networks to form powerful emergent representations of complex statistical patterns in data is as remarkable as imperfectly understood. For deep ReLU networks, these are encoded in the mixed discrete–continuous structure of linear weight matrices and non-linear binary activations. Our article develops a new technique for instrumenting such networks to efficiently record activation statistics, such as information content (entropy) and similarity of patterns, in real-world training runs. We then study the evolution of activation patterns during training for networks of different architecture using different training and initialization strategies. As a result, we see characteristic- and general-related as well as architecture-related behavioral patterns: in particular, most architectures form bottom-up structure, with the exception of highly tuned state-of-the-art architectures and methods (PyramidNet and FixUp), where layers appear to converge more simultaneously. We also observe intermediate dips in entropy in conventional CNNs that are not visible in residual networks. A reference implementation is provided under a free license.

Availability

No copy data

Detail Information

Series Title	-
Call Number	-
Publisher	Frontiers in Artificial Intelligence : Switzerland., 2021
Collation	006
Language	English
ISBN/ISSN	2624-8212
Classification	NONE
Content Type	-

Media Type	-
Carrier Type	-
Edition	-
Subject(s)	ReLu activation patterns neural activations feed-forward networks activation entropy
Specific Detail Info	-
Statement of Responsibility	-

Other Information

Accreditation	Scopus Q3

Other version/related

No other version available

File Attachment

Studying the Evolution of Neural Activation Patterns During Training of Feed-Forward ReLU Networks

Information

Web Online Public Access Catalog - Use the search options to find documents quickly