Vision-Language Models Tutorial

Mitigating Object Hallucination in Large Vision-Language Models via Visual Attention Direct Preference Optimization

Abstract: Large Vision-Language Models (LVLMs) suffer from severe object hallucinations, leading them to frequently generate outputs that do not correspond to the image content, significantly reducing ...

IEEE

Task Planning for a Factory Robot Using Large Language Model

Abstract: In recent years, automation has significantly advanced the automobile manufacturing industry. However, many tasks still involve human intervention, so there is a demand for the development ...

Tech Times

Proactive AI From JD.com Watches Your Camera and Speaks Without Prompting

Open source vision language model JoyAI-VL-Interaction from JD.com watches live video streams and speaks without being ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Mitigating Object Hallucination in Large Vision-Language Models via Visual Attention Direct Preference Optimization

Task Planning for a Factory Robot Using Large Language Model

Proactive AI From JD.com Watches Your Camera and Speaks Without Prompting

Trending now