Posts by Tags

Computing in Memory

DynaPlasia

less than 1 minute read

Published:

title: [ISSCC’23] DynaPlasia: An eDRAM In-Memory-Computing-Based Reconfigurable Spatial Accelerator with Triple-Mode Cell

Dynamic Neural Networks

Dynamic Neural Networks A Survey

less than 1 minute read

Published:

Authors: Yizeng Han, Gao Huang, Shiji Song, Le Yang, Honghui Wang, and Yulin Wang

HW-SW co-design

M3ViT

less than 1 minute read

Published:

[NeurIPS’2022] M3ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design

Marionette

less than 1 minute read

Published:

[Micro’23] Towards Efficient Control Flow Handling in Spatial Architecture via Architecting the Control Flow Plane

ICdesign

MLsys

Unity

5 minute read

Published:

title: Unity: Accelerating DNN Training Through Joint Optimization of Algebraic Transformations and Parallelization

MoE

M3ViT

less than 1 minute read

Published:

[NeurIPS’2022] M3ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design

Marionette

less than 1 minute read

Published:

[Micro’23] Towards Efficient Control Flow Handling in Spatial Architecture via Architecting the Control Flow Plane

Reconfigurable Architecture

DynaPlasia

less than 1 minute read

Published:

title: [ISSCC’23] DynaPlasia: An eDRAM In-Memory-Computing-Based Reconfigurable Spatial Accelerator with Triple-Mode Cell

SoC

Source Code

M3ViT

less than 1 minute read

Published:

[NeurIPS’2022] M3ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design

Survey

Dynamic Neural Networks A Survey

less than 1 minute read

Published:

Authors: Yizeng Han, Gao Huang, Shiji Song, Le Yang, Honghui Wang, and Yulin Wang

accelerator

HLS

less than 1 minute read

Published:

HLS example: acc_DTQRD

bit serial architecture

Cambricon-P

4 minute read

Published:

title: Cambricon-P: A Bitflow Architecture for Arbitrary Precision Computing

dataflow

HLS

less than 1 minute read

Published:

HLS example: acc_DTQRD

dataflow opt

Tangram

less than 1 minute read

Published:

Authors: Mingyu Gao, Xuan Yang, Jing Pu, Mark Horowitz, and Christos Kozyrakis

Eyeriss

1 minute read

Published:

Authors: Yu-Hsin Chen, Joel Emer, and Vivienne Sze

DOTA

less than 1 minute read

Published:

Authors: Zheng Qu, Liu Liu, Fengbin Tu, Zhaodong Chen, Yufei Ding, and Yuan Xie

QR Decomposition Acceleration

less than 1 minute read

Published:

Title: Dual-Triangular QR Decomposition with Global Acceleration and Partially Q-Rotation Skipping

future

Future Works

less than 1 minute read

Published:

Ready to read:

  • ViTCoD (HPCA 2023)
  • M3ViT

hardware mapping

Unity

5 minute read

Published:

title: Unity: Accelerating DNN Training Through Joint Optimization of Algebraic Transformations and Parallelization

matrix operation

QR Decomposition Acceleration

less than 1 minute read

Published:

Title: Dual-Triangular QR Decomposition with Global Acceleration and Partially Q-Rotation Skipping

network

Networks of Chiplets

1 minute read

Published:

title: A_Scalable_Methodology_for_Designing_Efficient_Interconnection_Network_of_Chiplets

parallelization

Unity

5 minute read

Published:

title: Unity: Accelerating DNN Training Through Joint Optimization of Algebraic Transformations and Parallelization

scalable accelerator

Networks of Chiplets

1 minute read

Published:

title: A_Scalable_Methodology_for_Designing_Efficient_Interconnection_Network_of_Chiplets

Cambricon-P

4 minute read

Published:

title: Cambricon-P: A Bitflow Architecture for Arbitrary Precision Computing

simulator

EventSimulator

less than 1 minute read

Published:

Sources: Simpy, Gem5, Structural Simulation Toolkit (SST)

TimeLoop

less than 1 minute read

Published:

Sources: NVlabs/Timeloop

spatial accelerator

Tangram

less than 1 minute read

Published:

Authors: Mingyu Gao, Xuan Yang, Jing Pu, Mark Horowitz, and Christos Kozyrakis

Eyeriss

1 minute read

Published:

Authors: Yu-Hsin Chen, Joel Emer, and Vivienne Sze

DOTA

less than 1 minute read

Published:

Authors: Zheng Qu, Liu Liu, Fengbin Tu, Zhaodong Chen, Yufei Ding, and Yuan Xie

QR Decomposition Acceleration

less than 1 minute read

Published:

Title: Dual-Triangular QR Decomposition with Global Acceleration and Partially Q-Rotation Skipping

tapeout

Eyeriss

1 minute read

Published:

Authors: Yu-Hsin Chen, Joel Emer, and Vivienne Sze

teaching

transformer

DOTA

less than 1 minute read

Published:

Authors: Zheng Qu, Liu Liu, Fengbin Tu, Zhaodong Chen, Yufei Ding, and Yuan Xie

verilog

HLS

less than 1 minute read

Published:

HLS example: acc_DTQRD