Pytorch Encoder/Decoder

SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation

We propose an encoder-decoder for open-vocabulary semantic segmentation comprising a hierarchical encoder-based cost map generation and a gradual fusion decoder. We introduce a category early ...

IEEE

MiSiCNet: Minimum Simplex Convolutional Network for Deep Hyperspectral Unmixing

Abstract: In this article, we propose a minimum simplex convolutional network (MiSiCNet) for deep hyperspectral unmixing. Unlike all the deep learning-based unmixing methods proposed in the literature ...

Tech Times

Google Gemma 4 12B Brings Multimodal AI to 16GB Laptops, Free Under Apache 2.0

Google Gemma 4 12B, released June 3, is an open-weight multimodal model that processes text, images, audio, and video in a ...

CNX Software

PCMFlow722 library enables two-way real-time HD voice over ESP-NOW with G.722 audio codec

Tanaka Masayuki's PCMFlow722 library enables (half-duplex) two-way real-time HD voice over ESP-NOW on ESP32 boards with a speaker and a microphone, ...

GitHub

Diffusion Transformers with Representation Autoencoders (RAE)

We present Representation Autoencoders (RAE), a class of autoencoders that utilize pretrained, frozen representation encoders such as DINOv2 and SigLIP2 as encoders with trained ViT decoders. RAE can ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results