ChengAo Shen's blog

Categories · Paper Reading

Home

About

Archives

loading..
DeepLearning

Reading Notes for “CellPose”

Abstract for this paper: Cell segmentation is important for various downstream tasks. This paper introduces a generalist segmentation method called Cellpose that can segment cells from various ranges of image types without any fine-tuning. They also collect a large dataset containing over 70000 segmented objects to train the model. Finally, they built soft..

Read more
loading..
DeepLearningActionRecognization

Reading Notes for "TimeSformer"

Abstract for this paper: This paper focuses on introducing Transformer architecture to video recognition to replace 3D CNN due to various benefits. They first used a complete formula to derive the method of calculating attention and building a model in video situations. Then, to reduce the computational cost, they try several attention schemes and finally ..

Read more
loading..
DeepLearningActionRecognization

Reading Notes for "TSN"

Abstract for this paper: Video-based tasks such as Action Recognition rely on long-range temporal information. Some methods use LSTM or other RNNs after feature extraction to utilize this information, but it will add more compute costs. Furthermore, the dominant end-to-end CNN model in video-based tasks is still lacking. This paper proposes TSN, which is a..

Read more
loading..
DeepLearningActionRecognization

Reading Notes for "C3D"

Abstract for this paper: In the past, most Action Recognition methods are based on manual features. Although some models use deep learning, they just use 2D ConvNets. This paper explores various 3D convolution kernels having different depths. After that, it proposes a C3D model that can perform well after being trained in a large dataset. The authors use C..

Read more
loading..
DeepLearningActionRecognization

Reading Notes for "Two-Stream CNN"

Abstract for this paper: Action Recognition is an important field in various vision tasks. Before this paper, most works were based on something other than Deep Learning, and although some papers tried using CNN, they couldn’t perform comparably. This paper proposes a Two-Stream ConvNet which introduces optical flow in architecture. This model is trained a..

Read more
loading..
DeepLearningDiffusion

Reading Notes for "InverseSR"

Abstract for this paper: High-resolution magnetic resonance imaging (MRI) scans are important to provide precise information about imaged tissues. Thus, we need to find the method for image super-resolution. Traditional methods based on end-to-end deep learning have to be retrained when the distribution of input shifts. This paper proposed a new method to ..

Read more