Remove Evaluation Remove Jing Remove Language
article thumbnail

Visual captions: Using large language models to augment video conferences with dynamic visuals

Google Research AI blog

We fine-tuned a large language model to proactively suggest relevant visuals in open-vocabulary conversations using a dataset we curated for this purpose. Visual intent prediction model To predict what visuals could supplement a conversation, we trained a visual intent prediction model based on a large language model using the VC1.5K

article thumbnail

Visual Blocks for ML: Accelerating machine learning prototyping with interactive tools

Google Research AI blog

It usually involves a cross-functional team of ML practitioners who fine-tune the models, evaluate robustness, characterize strengths and weaknesses, inspect performance in the end-use context, and develop the applications. Sign up to be notified when Visual Blocks for ML is publicly available.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Google at ICLR 2023

Google Research AI blog

Morcos , Dhruv Batra Offline Q-Learning on Diverse Multi-task Data Both Scales and Generalizes (see blog post ) Aviral Kumar , Rishabh Agarwal , Xingyang Geng , George Tucker , Sergey Levine ReAct: Synergizing Reasoning and Acting in Language Models (see blog post ) Shunyu Yao *, Jeffrey Zhao , Dian Yu , Nan Du , Izhak Shafran , Karthik R.

Google 105
article thumbnail

NpTech Summary: Advocacy 2.0, Sketchcastes, and NpTech in Different Languages

Beth's Blog: How Nonprofits Can Use Social Media

"We can't talk about transparency, accountability and honest evaluation without addressing the contentious topic of failure. Some beta work is happening on this with TechSmith's " Jing Project ," an application that allows you easily embed screencasts into conversations on both PC and MAC platform. What language is this?

Nptech 50
article thumbnail

Google at CVPR 2023

Google Research AI blog

Fleet , Radu Soricut , Jason Baldridge , Mohammad Norouzi , Peter Anderson , William Cha RUST: Latent Neural Scene Representations from Unposed Imagery Mehdi S.

Google 116
article thumbnail

Google at EMNLP 2022

Google Research AI blog

Posted by Malaya Jules, Program Manager, Google This week, the premier conference on Empirical Methods in Natural Language Processing (EMNLP 2022) is being held in Abu Dhabi, United Arab Emirates. We are proud to be a Diamond Sponsor of EMNLP 2022, with Google researchers contributing at all levels. Zhao , Yi Luan , Keith B.

Google 52
article thumbnail

Google at ICML 2023

Google Research AI blog

We build ML systems to solve deep scientific and engineering challenges in areas of language, music, visual processing, algorithm development, and more. Posted by Cat Armato, Program Manager, Google Groups across Google actively pursue research in the field of machine learning (ML), ranging from theory and application.

Google 77