A Multimodal Textbook for Vision-Language Pretraining

A Multimodal Textbook for Vision-Language Pretraining
[ad_1] [Submitted on 1 Jan 2025 (v1), last revised 27 Jan 2025 (this version, v3)] View a PDF of the ...
Read more

Zero-Shot Decision Tree Construction via Large Language Models

A Multimodal Textbook for Vision-Language Pretraining
[ad_1] arXiv:2501.16247v1 Announce Type: cross Abstract: This paper introduces a novel algorithm for constructing decision trees using large language models ...
Read more

QuanTaxo: A Quantum Approach to Self-Supervised Taxonomy Expansion

A Multimodal Textbook for Vision-Language Pretraining
[ad_1] arXiv:2501.14011v1 Announce Type: cross Abstract: A taxonomy is a hierarchical graph containing knowledge to provide valuable insights for various ...
Read more

[2501.14719] Do LLMs Provide Consistent Answers to Health-Related Questions across Languages?

A Multimodal Textbook for Vision-Language Pretraining
[ad_1] [Submitted on 24 Jan 2025] View a PDF of the paper titled Do LLMs Provide Consistent Answers to Health-Related ...
Read more

[2501.13833] On the Reasoning Capacity of AI Models and How to Quantify It

A Multimodal Textbook for Vision-Language Pretraining
[ad_1] [Submitted on 23 Jan 2025] View a PDF of the paper titled On the Reasoning Capacity of AI Models ...
Read more

[2412.09460] The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective

A Multimodal Textbook for Vision-Language Pretraining
[ad_1] [Submitted on 12 Dec 2024 (v1), last revised 22 Jan 2025 (this version, v3)] Authors:Javier de la Rosa, Vladislav ...
Read more

Triplet-Grid Framework for Discontinuous Named Entity Recognition

A Multimodal Textbook for Vision-Language Pretraining
[ad_1] [Submitted on 4 Nov 2024 (v1), last revised 22 Jan 2025 (this version, v2)] View a PDF of the ...
Read more

Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?

A Multimodal Textbook for Vision-Language Pretraining
[ad_1] arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both ...
Read more

Mixture of Prompts for LLM Task Adaptation

A Multimodal Textbook for Vision-Language Pretraining
[ad_1] [Submitted on 4 Oct 2023 (v1), last revised 18 Jan 2025 (this version, v3)] View a PDF of the ...
Read more

A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning

A Multimodal Textbook for Vision-Language Pretraining
[ad_1] [Submitted on 2 Jul 2024 (v1), last revised 17 Jan 2025 (this version, v2)] View a PDF of the ...
Read more