Liu, H.T., Li, C.Y., Wu, Q.Y. and Lee, Y.J. (2023) Visual Instruction Tuning. - References - Scientific Research Publishing

Journals A-Z

Journals by Subject

Journals by Subject

Publish with us

Publish with us

Follow SCIRP

Contact us

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Article citationsMore>>

Liu, H.T., Li, C.Y., Wu, Q.Y. and Lee, Y.J. (2023) Visual Instruction Tuning.

has been cited by the following article:

TITLE: Object Detection Meets LLMs: Model Fusion for Safety and Security

AUTHORS: Zeba Mohsin Wase, Vijay K. Madisetti, Arshdeep Bahga

KEYWORDS: Computer Vision, Large Language Models, Self Driving Vehicles

JOURNAL NAME: Journal of Software Engineering and Applications, Vol.16 No.12, December 27, 2023

ABSTRACT: This paper proposes a novel model fusion approach to enhance predictive capabilities of vision and language models by strategically integrating object detection and large language models. We have named this multimodal integration approach as VOLTRON (Vision Object Linguistic Translation for Responsive Observation and Narration). VOLTRON is aimed at improving responses for self-driving vehicles in detecting small objects crossing roads and identifying merged or narrower lanes. The models are fused using a single layer to provide LLaMA2 (Large Language Model Meta AI) with object detection probabilities from YoloV8-n (You Only Look Once) translated into sentences. Experiments using specialized datasets showed accuracy improvements up to 88.16%. We provide a comprehensive exploration of the theoretical aspects that inform our model fusion approach, detailing the fundamental principles upon which it is built. Moreover, we elucidate the intricacies of the methodologies employed for merging these two disparate models, shedding light on the techniques and strategies used.

Related Articles:

Open Access

Articles

Concept of Visual Representation into Calligraphy in Overseas Chinese Language Instruction

Yanrong Li, Yucheng Shen, Hui Zhang

Open Journal of Social Sciences Vol.11 No.10, October 31, 2023

DOI: 10.4236/jss.2023.1110040
Open Access

Articles

Valuation of Game Swaptions under the Generalized Ho-Lee Model

Aki Ebina, Natsumi Ochiai, Masamitsu Ohnishi

Journal of Mathematical Finance Vol.6 No.5, November 30, 2016

DOI: 10.4236/jmf.2016.65065
Open Access

Articles

The Wu Xing Theory and Homeostatic Interaction of Organs

Yevgeny V. Albegov, Dmitry V. Butenko, Lyudmila N. Butenko

Chinese Medicine Vol.1 No.2, November 2, 2010

DOI: 10.4236/cm.2010.12009
Open Access

Articles

Design of a Li-Fi Transceiver

Pavas Goswami, Manoj Kumar Shukla

Wireless Engineering and Technology Vol.8 No.4, October 31, 2017

DOI: 10.4236/wet.2017.84006
Open Access

Articles

Determination of Instantaneous Frequencies of Low Plasma Waves in the Magnetosheath Using Empirical Mode Decomposition (EMD) and Hilbert Transform (HT)

Ekong Ufot Nathaniel, Nyakno Jimmy George, Sunday Edet Etuk

Atmospheric and Climate Sciences Vol.3 No.4, October 15, 2013

DOI: 10.4236/acs.2013.34060

Follow SCIRP

Weibo

Contact us

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Free SCIRP Newsletters

Add your e-mail address to receive free newsletters from SCIRP.

Copyright © 2006-2024 Scientific Research Publishing Inc. All Rights Reserved.

Top