Close Menu
West TimelinesWest Timelines
  • News
  • Politics
  • World
    • Africa
    • Asia
    • Australia
    • Europe
      • United Kingdom
      • Germany
      • France
      • Italy
      • Russia
      • Spain
      • Turkey
      • Ukraine
    • North America
      • United States
      • Canada
    • South America
  • Business
    • Finance
    • Markets
    • Investing
    • Small Business
    • Crypto
  • Elections
  • Entertainment
  • Health
  • Lifestyle
    • Fashion
    • Food & Drink
    • Travel
    • Astrology
  • Weird News
  • Science
  • Sports
    • Soccer
  • Technology
  • Viral Trends
Trending Now

Dubai Spotlight: Analyzing the Evolving Audience Tastes with AI Social Listening Tools in the UAE

4 weeks ago

مرآة التاريخ: تحليل البناء السردي للدروس الخالدة في قصص الأنبياء والإسلام

4 weeks ago

السندات الحكومية والشركات: أساسيات الاستثمار الآمن والدخل الثابت

1 month ago

UAE Ranks Among Top Rugby Markets on TOD as British & Irish Lions Tour Kicks Off

5 months ago

Darven: A New Leap in AI-Powered Legal Technology Launching from the UAE to the World

6 months ago
Facebook X (Twitter) Instagram
West TimelinesWest Timelines
  • News
  • US
  • #Elections
  • World
    • North America
      • United States
      • Canada
    • Europe
      • United Kingdom
      • Germany
      • France
      • Italy
      • Spain
      • Ukraine
      • Russia
      • Turkey
    • Asia
    • Australia
    • Africa
    • South America
  • Politics
  • Business
    • Finance
    • Investing
    • Markets
    • Small Business
    • Crypto
  • Lifestyle
    • Astrology
    • Fashion
    • Food & Drink
    • Travel
  • Health
  • Sports
    • Soccer
  • More
    • Entertainment
    • Technology
    • Science
    • Viral Trends
    • Weird News
Subscribe
  • Israel War
  • Ukraine War
  • United Kingdom
  • Canada
  • Germany
  • France
  • Italy
  • Russia
  • Spain
  • Turkey
  • Ukraine
West TimelinesWest Timelines
Home»Technology
Technology

rewrite this title University of Washington researchers craft method of fine-tuning AI chatbots for individual taste

12 months agoNo Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Telegram Email WhatsApp Copy Link

Summarize this content to 2000 words in 6 paragraphs

Natasha Jaques, an assistant professor at the University of Washington’s Paul G. Allen School of Computer Science & Engineering. (UW Photo)

As artificial intelligence chatbots are popping up to provide information in all sorts of applications, University of Washington researchers have developed a new way to fine-tune their responses.

Dubbed “variational preference learning,” the goal of the method is to shape a large language model’s output to better match an individual user according to their expressed preferences.

AI systems are trained on datasets that include baked-in biases and inappropriate information that engineers currently try to filter out of responses through “reinforcement learning from human feedback,” or RLHF. The strategy requires a group of people to review outputs from the chatbots and select the preferred answer, nudging the system to a safe, accurate and acceptable response.

But those preferences are determined by the organization creating the chatbot and don’t necessarily include the wide-ranging views held among the diverse users engaging with the tools.

“I think it’s a little scary that we have researchers at a handful of corporations, who aren’t trained in policy or sociology, deciding what is appropriate and what is not for the models to say, and we have so many people using these systems and trying to find out the truth from them,” said Natasha Jaques, an assistant professor at the UW’s Paul G. Allen School of Computer Science & Engineering, in a UW post.

“This is one of the more pressing problems in AI,” she said, “so we need better techniques to address it.”

Jaques leads the Social Reinforcement Learning Lab at the UW and is also a senior research scientist at Google DeepMind. She joined the UW’s Allen School nearly two years ago.

Jaques gave an example of a case when the RLHF training approach could create a problem. Imagine a lower-income student was interacting with a chatbot to learn more about a college they wanted to apply to, but the model’s response was tuned for the majority of the school’s applications, which was higher-income students. The model would deduce that there was limited interest in financial aid information and not provide it.

The variational preference learning approach developed by the UW researchers would put the chatbot users themselves in the role of refining the outputs. And it can do it quickly — with just four queries, the VPL training method can learn what sort of responses a user will choose.

The fine-tuning can include the preferred level of specificity of the answer, the length and tone of the output, as well as which information is included.

The strategy could be applied to verbal interactions as well as training robots performing simple tasks in personal settings such as homes.

But VPL does need to watch out for preferences for misinformation or disinformation, as well as inappropriate responses, Jaques said.

Jaques and colleagues shared their research at last week’s Conference on Neural Information Processing Systems in Vancouver, B.C.

Additional co-authors of the study include Allen School assistant professor Abhishek Gupta, as well as Allen School doctoral students Sriyash Poddar, Yanming Wan and Hamish Ivison.

Jaques said participants in the long-running international conference were interested in the issue of promoting diverse perspectives in AI systems that she and others are tackling.

“I’m encouraged to see the receptiveness of the AI community and momentum in this area,” Jaques told GeekWire.

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest Email Telegram WhatsApp Copy Link

You Might Like

Dubai Spotlight: Analyzing the Evolving Audience Tastes with AI Social Listening Tools in the UAE

Darven: A New Leap in AI-Powered Legal Technology Launching from the UAE to the World

Array

Array

Array

Array

Editors Picks

مرآة التاريخ: تحليل البناء السردي للدروس الخالدة في قصص الأنبياء والإسلام

4 weeks ago

السندات الحكومية والشركات: أساسيات الاستثمار الآمن والدخل الثابت

1 month ago

UAE Ranks Among Top Rugby Markets on TOD as British & Irish Lions Tour Kicks Off

5 months ago

Darven: A New Leap in AI-Powered Legal Technology Launching from the UAE to the World

6 months ago

Jordan to Host Iraq in the Final Round of the Asian World Cup Qualifiers After Securing Historic Spot

6 months ago

Latest News

فلسطين: قلبٌ ينبض بالصمود والأمل

7 months ago

Roland Garros 2025: A New Era of Viewing, A Tribute to Legends, and Moments to Remember

7 months ago

Array

7 months ago
Advertisement
Facebook X (Twitter) TikTok Instagram Threads
© 2025 West Timelines. All Rights Reserved. Developed By: Sawah Solutions
  • Privacy Policy
  • Terms
  • Contact

Type above and press Enter to search. Press Esc to cancel.