Logo image
A Comparative Study of Neural Network Pruning Strategies for AI Deployment on Edge Devices
Conference proceeding   Peer reviewed

A Comparative Study of Neural Network Pruning Strategies for AI Deployment on Edge Devices

Complex Networks & Their Applications XIII: Proceedings of The Thirteenth International Conference on Complex Networks and Their Applications: COMPLEX NETWORKS 2024 - Volume 1, Vol.1187, pp.3-14
Studies in Computational Intelligence, 1187
Complex Networks & Their Applications 2024 (Istanbul, 10/12/2024–12/12/2024)
2025
Handle:
https://hdl.handle.net/10863/50283

Abstract

Data-driven Artificial Intelligence (D2AI) Multi-Layer Perceptron Sparse Neural Networks Pruning Techniques Sparse Evolutionary Training Graph theory
Artificial neural networks have become crucial across fields like IoT, computer vision, and medicine. Their use for a variety of industrial applications, and the ever-expanding IoT, contributed to a growing interest for lightweight neural network models suitable for deployment in environments with limited computational capabilities. Pruning techniques aim to address the computational and storage demands of these models. In this work, we investigate the impact that different pruning methods have on Multi-Layer Perceptron (MLP) networks. We compare pre-training, in-training, post-training, and the SET-Method pruning approaches while considering a variety of parameters. We find that highly sparse small-scale MLPs can achieve accuracies similar to their fully connected counterparts. Furthermore, energy consumption and inference time are primarily influenced by model size, rather than sparsity levels. This research provides insight into optimizing and further understanding neural networks and their applicability to real-world applications.
url
https://doi.org/10.1007/978-3-031-82427-2_1View

Details

Metrics

8 Record Views
Logo image