Intro to 경량화

모델 최적화

jnnwnn 2021. 11. 22. 15:38

1. Efficient architecture design; AutoML, Neural Architecture Search

2. Network Pruning; 찾은 모델 줄이기

중요도가 낮은 파라미터를 제거
좋은 중요드를 정의, 찾는 것이 주요 연구 토픽 중 하나 (L2 norm이 크면, loss gradient 크면)
structured/unstructured pruning으로 나뉨
- Structured Pruning: 파라미터를 그룹 단위(channel, filter, layer 등)로 pruning 하는 기법으로 Dense computation에 최적화됨
- Unstructured Pruning: 파라미터를 독립적으로 pruning 하는 기법으로, 네트워크 내부의 행렬이 점차 sparse 해진다. sparse computatoin에 최적화된 소프트웨어 또는 하드웨어에 적합.

3. Knowledge distillation

student network와 gt label의 cross entropy & teacher network와 student network의 Inference 결과에 대한 KLD loss로 구성

4. Matrix/Tensor decomposition

5. Network Quantization

6. Network Compiling