語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Computer vision - ECCV 2022 = 17th E...
~
European Conference on Computer Vision (2022 :)
FindBook
Google Book
Amazon
博客來
Computer vision - ECCV 2022 = 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings.. Part XXXVI /
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Computer vision - ECCV 2022/ edited by Shai Avidan ... [et al.].
其他題名:
17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings.
其他作者:
Avidan, Shai.
團體作者:
European Conference on Computer Vision
出版者:
Cham :Springer Nature Switzerland : : 2022.,
面頁冊數:
lvi, 755 p. :ill., digital ;24 cm.
內容註:
Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing -- Generative Negative Text Replay for Continual Vision-Language Pretraining -- Video Graph Transformer for Video Question Answering -- Trace Controlled Text to Image Generation -- Video Question Answering with Iterative Video-Text Co-Tokenization -- Rethinking Data Augmentation for Robust Visual Question Answering -- Explicit Image Caption Editing -- Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding -- Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly -- GRIT: Faster and Better Image Captioning Transformer Using Dual Visual Features -- Selective Query-Guided Debiasing for Video Corpus Moment Retrieval -- Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding -- Object-Centric Unsupervised Image Captioning -- Contrastive Vision-Language Pre-training with Limited Resources -- Learning Linguistic Association towards Efficient Text-Video Retrieval -- ASSISTER: Assistive Navigation via Conditional Instruction Generation -- X-DETR: A Versatile Architecture for Instance-Wise Vision-Language Tasks -- Learning Disentanglement with Decoupled Labels for Vision-Language Navigation -- Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input -- Word-Level Fine-Grained Story Visualization -- Unifying Event Detection and Captioning as Sequence Generation via Pre-training -- Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation -- Fine-Grained Visual Entailment -- Bottom Up Top down Detection Transformers for Language Grounding in Images and Point Clouds -- New Datasets and Models for Contextual Reasoning in Visual Dialog -- VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection -- Classification-Regression for Chart Comprehension -- AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant -- FindIt: Generalized Localization with Natural Language Queries -- UniTAB: Unifying Text and Box Outputs for Grounded VisionLanguage Modeling -- Scaling Open-Vocabulary Image Segmentation with Image-Level Labels -- The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning -- Speaker-Adaptive Lip Reading with User-Dependent Padding -- TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation -- SemAug: Semantically Meaningful Image Augmentations for Object Detection through Language Grounding -- Referring Object Manipulation of Natural Images with Conditional Classifier-Free Guidance -- NewsStories: Illustrating Articles with Visual Summaries -- Webly Supervised Concept Expansion for General Purpose Vision Models -- FedVLN: Privacy-Preserving Federated Vision-and-Language Navigation -- CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval -- Language-Driven Artistic Style Transfer -- Single-Stream Multi-level Alignment for Vision-Language Pretraining.
Contained By:
Springer Nature eBook
標題:
Computer vision - Congresses. -
電子資源:
https://doi.org/10.1007/978-3-031-20059-5
ISBN:
9783031200595
Computer vision - ECCV 2022 = 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings.. Part XXXVI /
Computer vision - ECCV 2022
17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings.Part XXXVI /[electronic resource] :edited by Shai Avidan ... [et al.]. - Cham :Springer Nature Switzerland :2022. - lvi, 755 p. :ill., digital ;24 cm. - Lecture notes in computer science,136960302-9743 ;. - Lecture notes in computer science ;13696..
Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing -- Generative Negative Text Replay for Continual Vision-Language Pretraining -- Video Graph Transformer for Video Question Answering -- Trace Controlled Text to Image Generation -- Video Question Answering with Iterative Video-Text Co-Tokenization -- Rethinking Data Augmentation for Robust Visual Question Answering -- Explicit Image Caption Editing -- Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding -- Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly -- GRIT: Faster and Better Image Captioning Transformer Using Dual Visual Features -- Selective Query-Guided Debiasing for Video Corpus Moment Retrieval -- Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding -- Object-Centric Unsupervised Image Captioning -- Contrastive Vision-Language Pre-training with Limited Resources -- Learning Linguistic Association towards Efficient Text-Video Retrieval -- ASSISTER: Assistive Navigation via Conditional Instruction Generation -- X-DETR: A Versatile Architecture for Instance-Wise Vision-Language Tasks -- Learning Disentanglement with Decoupled Labels for Vision-Language Navigation -- Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input -- Word-Level Fine-Grained Story Visualization -- Unifying Event Detection and Captioning as Sequence Generation via Pre-training -- Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation -- Fine-Grained Visual Entailment -- Bottom Up Top down Detection Transformers for Language Grounding in Images and Point Clouds -- New Datasets and Models for Contextual Reasoning in Visual Dialog -- VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection -- Classification-Regression for Chart Comprehension -- AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant -- FindIt: Generalized Localization with Natural Language Queries -- UniTAB: Unifying Text and Box Outputs for Grounded VisionLanguage Modeling -- Scaling Open-Vocabulary Image Segmentation with Image-Level Labels -- The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning -- Speaker-Adaptive Lip Reading with User-Dependent Padding -- TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation -- SemAug: Semantically Meaningful Image Augmentations for Object Detection through Language Grounding -- Referring Object Manipulation of Natural Images with Conditional Classifier-Free Guidance -- NewsStories: Illustrating Articles with Visual Summaries -- Webly Supervised Concept Expansion for General Purpose Vision Models -- FedVLN: Privacy-Preserving Federated Vision-and-Language Navigation -- CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval -- Language-Driven Artistic Style Transfer -- Single-Stream Multi-level Alignment for Vision-Language Pretraining.
The 39-volume set, comprising the LNCS books 13661 until 13699, constitutes the refereed proceedings of the 17th European Conference on Computer Vision, ECCV 2022, held in Tel Aviv, Israel, during October 23-27, 2022. The 1645 papers presented in these proceedings were carefully reviewed and selected from a total of 5804 submissions. The papers deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; object recognition; motion estimation.
ISBN: 9783031200595
Standard No.: 10.1007/978-3-031-20059-5doiSubjects--Topical Terms:
570734
Computer vision
--Congresses.
LC Class. No.: TA1634 / .E87 2022
Dewey Class. No.: 006.37
Computer vision - ECCV 2022 = 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings.. Part XXXVI /
LDR
:04928nmm a2200337 a 4500
001
2304792
003
DE-He213
005
20221028172922.0
006
m d
007
cr nn 008maaau
008
230409s2022 sz s 0 eng d
020
$a
9783031200595
$q
(electronic bk.)
020
$a
9783031200588
$q
(paper)
024
7
$a
10.1007/978-3-031-20059-5
$2
doi
035
$a
978-3-031-20059-5
040
$a
GP
$c
GP
041
0
$a
eng
050
4
$a
TA1634
$b
.E87 2022
072
7
$a
UYQV
$2
bicssc
072
7
$a
COM012000
$2
bisacsh
072
7
$a
UYQV
$2
thema
082
0 4
$a
006.37
$2
23
090
$a
TA1634
$b
.E89 2022
111
2
$a
European Conference on Computer Vision
$n
(17th :
$d
2022 :
$c
Tel Aviv, Israel)
$3
3607246
245
1 0
$a
Computer vision - ECCV 2022
$h
[electronic resource] :
$b
17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings.
$n
Part XXXVI /
$c
edited by Shai Avidan ... [et al.].
260
$a
Cham :
$b
Springer Nature Switzerland :
$b
Imprint: Springer,
$c
2022.
300
$a
lvi, 755 p. :
$b
ill., digital ;
$c
24 cm.
490
1
$a
Lecture notes in computer science,
$x
0302-9743 ;
$v
13696
505
0
$a
Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing -- Generative Negative Text Replay for Continual Vision-Language Pretraining -- Video Graph Transformer for Video Question Answering -- Trace Controlled Text to Image Generation -- Video Question Answering with Iterative Video-Text Co-Tokenization -- Rethinking Data Augmentation for Robust Visual Question Answering -- Explicit Image Caption Editing -- Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding -- Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly -- GRIT: Faster and Better Image Captioning Transformer Using Dual Visual Features -- Selective Query-Guided Debiasing for Video Corpus Moment Retrieval -- Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding -- Object-Centric Unsupervised Image Captioning -- Contrastive Vision-Language Pre-training with Limited Resources -- Learning Linguistic Association towards Efficient Text-Video Retrieval -- ASSISTER: Assistive Navigation via Conditional Instruction Generation -- X-DETR: A Versatile Architecture for Instance-Wise Vision-Language Tasks -- Learning Disentanglement with Decoupled Labels for Vision-Language Navigation -- Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input -- Word-Level Fine-Grained Story Visualization -- Unifying Event Detection and Captioning as Sequence Generation via Pre-training -- Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation -- Fine-Grained Visual Entailment -- Bottom Up Top down Detection Transformers for Language Grounding in Images and Point Clouds -- New Datasets and Models for Contextual Reasoning in Visual Dialog -- VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection -- Classification-Regression for Chart Comprehension -- AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant -- FindIt: Generalized Localization with Natural Language Queries -- UniTAB: Unifying Text and Box Outputs for Grounded VisionLanguage Modeling -- Scaling Open-Vocabulary Image Segmentation with Image-Level Labels -- The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning -- Speaker-Adaptive Lip Reading with User-Dependent Padding -- TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation -- SemAug: Semantically Meaningful Image Augmentations for Object Detection through Language Grounding -- Referring Object Manipulation of Natural Images with Conditional Classifier-Free Guidance -- NewsStories: Illustrating Articles with Visual Summaries -- Webly Supervised Concept Expansion for General Purpose Vision Models -- FedVLN: Privacy-Preserving Federated Vision-and-Language Navigation -- CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval -- Language-Driven Artistic Style Transfer -- Single-Stream Multi-level Alignment for Vision-Language Pretraining.
520
$a
The 39-volume set, comprising the LNCS books 13661 until 13699, constitutes the refereed proceedings of the 17th European Conference on Computer Vision, ECCV 2022, held in Tel Aviv, Israel, during October 23-27, 2022. The 1645 papers presented in these proceedings were carefully reviewed and selected from a total of 5804 submissions. The papers deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; object recognition; motion estimation.
650
0
$a
Computer vision
$x
Congresses.
$3
570734
650
0
$a
Pattern recognition systems
$v
Congresses.
$3
563039
650
1 4
$a
Computer Vision.
$3
3538524
650
2 4
$a
Computer Engineering and Networks.
$3
3538504
650
2 4
$a
Automated Pattern Recognition.
$3
3538549
650
2 4
$a
Natural Language Processing (NLP)
$3
3381674
700
1
$a
Avidan, Shai.
$3
3607247
710
2
$a
SpringerLink (Online service)
$3
836513
773
0
$t
Springer Nature eBook
830
0
$a
Lecture notes in computer science ;
$v
13696.
$3
3607303
856
4 0
$u
https://doi.org/10.1007/978-3-031-20059-5
950
$a
Computer Science (SpringerNature-11645)
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9446341
電子資源
11.線上閱覽_V
電子書
EB TA1634 .E87 2022
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入