Sample-efficient deep learning for COVID-19 diagnosis based on CT scans

X He, X Yang, S Zhang, J Zhao, Y Zhang, E Xing, P Xie - medrxiv, 2020 - medrxiv.org
Coronavirus disease 2019 (COVID-19) has infected more than 1.3 million individuals all
over the world and caused more than 106,000 deaths. One major hurdle in controlling the …

Covid-ct-dataset: a ct scan dataset about covid-19

J Zhao, Y Zhang, X He, P Xie - 2020 - europepmc.org
CT scans are promising in providing accurate, fast, and cheap screening and testing of
COVID-19. In this paper, we build a publicly available COVID-CT dataset, containing 275 CT …

Training-free structured diffusion guidance for compositional text-to-image synthesis

W Feng, X He, TJ Fu, V Jampani, A Akula… - arXiv preprint arXiv …, 2022 - arxiv.org
Large-scale diffusion models have achieved state-of-the-art results on text-to-image synthesis
(T2I) tasks. Despite their ability to generate high-quality yet creative images, we observe …

COVID-CT-dataset: a CT scan dataset about COVID-19

X Yang, X He, J Zhao, Y Zhang, S Zhang… - arXiv preprint arXiv …, 2020 - arxiv.org
During the outbreak time of COVID-19, computed tomography (CT) is a useful manner for
diagnosing COVID-19 patients. Due to privacy issues, publicly available COVID-19 CT …

Layoutgpt: Compositional visual planning and generation with large language models

…, T Fu, V Jampani, A Akula, X He… - Advances in …, 2024 - proceedings.neurips.cc
Attaining a high degree of user controllability in visual generation often requires intricate,
fine-grained inputs like layouts. However, such inputs impose a substantial burden on users …

[PDF][PDF] Parameter-efficient fine-tuning for vision transformers

X He, C Li, P Zhang, J Yang… - arXiv preprint arXiv …, 2022 - openreview.net
In computer vision, it has achieved great success in adapting large-scale pretrained vision
models (eg, Vision Transformer) to downstream tasks via fine-tuning. Common approaches …

Pathvqa: 30000+ questions for medical visual question answering

X He, Y Zhang, L Mou, E Xing, P Xie - arXiv preprint arXiv:2003.10286, 2020 - arxiv.org
Is it possible to develop an "AI Pathologist" to pass the board-certified examination of the
American Board of Pathology? To achieve this goal, the first step is to create a visual question …

Parameter-efficient model adaptation for vision transformers

X He, C Li, P Zhang, J Yang, XE Wang - Proceedings of the AAAI …, 2023 - ojs.aaai.org
In computer vision, it has achieved great transfer learning performance via adapting large-scale
pretrained vision models (eg, vision transformers) to downstream tasks. Common …

[PDF][PDF] Towards Visual Question Answering on Pathology Images.

X He - Proceedings of the 59th annual meeting of the …, 2021 - par.nsf.gov
Pathology imaging is broadly used for identifying the causes and effects of diseases or
injuries. Given a pathology image, being able to answer questions about the clinical findings …

Minigpt-5: Interleaved vision-and-language generation via generative vokens

K Zheng, X He, XE Wang - arXiv preprint arXiv:2310.02239, 2023 - arxiv.org
Large Language Models (LLMs) have garnered significant attention for their advancements
in natural language processing, demonstrating unparalleled prowess in text comprehension …