Difference between revisions of "User talk:Jungvi"

@@ Line 1: / Line 1: @@
-<!-- On-Device Training Sparse Sub-Tensor Update Scheme Optimization for CNN-based tasks (SA or MA) -->
-= Overview =
-== Status: Available ==
-* Type: Semester Thesis
-* Professor: Prof. Dr. L. Benini
-* Supervisors:
-** Victor Jung (IIS): [mailto:jungvi@iis.ee.ethz.ch jungvi@iis.ee.ethz.ch]
-** Cristian Cioflan (IIS): [mailto:cioflanc@iis.ee.ethz.ch cioflanc@iis.ee.ethz.ch]
-<!-- TODO: ADD APPROPRIATE CAGEGORIES HERE -->
-[[Category:2022]]
-[[Category:Semester Thesis]]
-[[Category:Master Thesis]]
-[[Category:Hot]]
-[[Category:Deep Learning Projects]]
-[[Category:Digital]]
-[[Category:Jungvi]]
-[[Category:Cioflanc]]
-[[Category:Available]]
-= Introduction =
-The fast development of the Internet-of Things (IoT) comes with the growing need for smart end-node devices able to execute Deep Learning networks locally. Processing the data on device has many advantages, not only drastically reducing the latency and communication energy cost, but also taking one step towards autonomous IoT end-nodes. Most of the current research efforts are focusing on inference, under the train-then-deploy paradigm. However, this results in a device unable to face real-life phenomena such as data distribution shifts or class increments.
-To adapt to these phenomena, several On-Device Training techniques have been proposed in the last few years. However, training on device still requires a considerable amount of memory, a significant challenge in the context of tightly memory constrained devices such as Microcontrollers (MCUs).
-In this project, we explore new methods to reduce the memory footprint for on-device training by pruning sub-tensors based on their gradients' contribution to the accuracy, followed by extrapolating the findings to inference.
-== Character ==
-* 15% Literature research
-* 50% Sparse Update Implementation
-* 35% Benchmarking
-== Prerequisites ==
-* Experience with Python and PyTorch.
-* Knowledge of Deep Learning
-= Project Goals =
-The main tasks of this project are:
-<ul>
-<li><p>'''T1: Python implementation and evaluation setup'''</p>
-<p> You will implement the sparse update scheme proposed by Lin et al.[[#ref-linondevice2022|&#91;1&#93;]], evaluating it on MobileNetV2 [[#ref-mobilenetsandler2018|&#91;2&#93;]] at a channel-level granularity on an image classification task [[#ref-deng2009imagenet|&#91;3&#93;]].</p></li>
-<li><p>'''T2: Sparse Scheme Optimizer and benchmarking'''</p>
-<p>You will leverage Evolutionary Search to quickly reach good sparse scheme. The type of Evolutionary algorithm and implementation will be carefully studied to fit well this particular optimization problem. Then the optimizer will be benchmarked against a random search baseline.</p></li>
-<li><p>'''T3: Sparse Scheme Optimizer and benchmarking'''</p>
-<p>Following the optimization and evaluation on a layer level developed in T1-T2, increase the granularity to sub-tensor level. You will evaluate the implementation over different datasets [[#ref-deng2009imagenet|&#91;3&#93;]] [[#ref-Krizhevsky09learningmultiple|&#91;4&#93;]] and/or tasks [[#ref-warden2018|&#91;5&#93;]] [[#ref-mswc2021|&#91;6&#93;]].</p></li>
-<li><p>'''Optional T1: Extend the method to describe inference sparsity'''</p>
-<p>Using the results obtained in T2, you will evaluate the ability of describing the inference sparsity using the gradient contribution to the accuracy.</p></li></ul>
-= Project Organization =
-== Weekly Meetings ==
-The student shall meet with the advisor(s) every week in order to discuss any issues/problems that may have persisted during the previous week and with a suggestion of next steps. These meetings are meant to provide a guaranteed time slot for mutual exchange of information on how to proceed, clear out any questions from either side and to ensure the student’s progress.
-== Report ==
-Documentation is an important and often overlooked aspect of engineering. One final report has to be completed within this project. Any form of word processing software is allowed for writing the reports, nevertheless the use of LaTeX with Tgif (See: http://bourbon.usc.edu:8001/tgif/index.html and http://www.dz.ee.ethz.ch/en/information/how-to/drawing-schematics.html) or any other vector drawing software (for block diagrams) is strongly encouraged by the IIS staff.
-==== Final Report ====
-A digital copy of the report, the presentation, the developed software, build script/project files, drawings/illustrations, acquired data, etc. needs to be handed in at the end of the project. Note that this task description is part of your report and has to be attached to your final report.
-== Presentation ==
-At the end of the project, the outcome of the thesis will be presented in a 15-minutes talk and 5 minutes of discussion in front of interested people of the Integrated Systems Laboratory. The presentation is open to the public, so you are welcome to invite interested friends. The exact date will be determined towards the end of the work.
-= References =
-<div id="refs" class="references csl-bib-body">
-<div id="ref-linondevice2022" class="csl-entry">
-<span class="csl-left-margin">&#91;1&#93; </span><span class="csl-right-inline">Lin, Ji and Zhu, Ligeng and Chen, Wei-Ming and Wang, Wei-Chen and Gan, Chuang and Han, Song. <span><span class="nocase">On-Device Training Under 256KB Memory. </span></span> 2022.</span>
-</div>
-<div id="ref-mobilenetsandler2018" class="csl-entry">
-<span class="csl-left-margin">&#91;2&#93; </span><span class="csl-right-inline">Sandler, Mark and Howard, Andrew and Zhu, Menglong and Zhmoginov, Andrey and Chen, Liang-Chieh. <span><span class="nocase">MobileNetV2: Inverted Residuals and Linear Bottlenecks. </span></span> 2018.</span>
-</div>
-<div id="ref-deng2009imagenet" class="csl-entry">
-<span class="csl-left-margin">&#91;3&#93; </span><span class="csl-right-inline">Deng, Jia and Dong, Wei and Socher, Richard and Li, Li-Jia and Li, Kai and Fei-Fei, Li. <span>Imagenet: A large-scale hierarchical image database. </span>2009. </span>
-</div>
-<div id="ref-Krizhevsky09learningmultiple" class="csl-entry">
-<span class="csl-left-margin">&#91;4&#93; </span><span class="csl-right-inline">Alex Krizhevsky. <span>Learning multiple layers of features from tiny images. </span>2009.</span>
-</div>
-<div id="ref-warden2018" class="csl-entry">
-<span class="csl-left-margin">&#91;5&#93; </span><span class="csl-right-inline">Warden, Pete. <span>Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition. </span>2018.</span>
-</div>
-<div id="ref-mswc2021" class="csl-entry">
-<span class="csl-left-margin">&#91;6&#93; </span><span class="csl-right-inline">Mazumder, Mark and Chitlangia, Sharad and Banbury, Colby and Kang, Yiping and Ciro, Juan and Achorn, Keith and Galvez, Daniel and Sabini, Mark and Mattson, Peter and Kanter, David and Diamos, Greg and Warden, Pete and Meyer, Josh and Janapa Reddi, Vijay. <span>Multilingual Spoken Words Corpus. </span>2021.</span>
-</div>
-</div>

Personal tools

Difference between revisions of "User talk:Jungvi" - iis-projects

Search

Navigation

Tools

Difference between revisions of "User talk:Jungvi"

From iis-projects

Latest revision as of 12:11, 14 September 2022

	This page was last modified on 14 September 2022, at 12:11. Privacy policy About iis-projects Disclaimers
	Mozilla Cavendish Theme based on Cavendish style by Gabriel Wicke modified by DaSch for the Web Community Wiki github Projectpage – Report Bug – Skin-Version: 2.3.5