Writing backwards can trick an AI into providing a bomb recipe

2 weeks ago01 mins

AI models have safeguards in place to prevent them creating dangerous or illegal output, but a range of jailbreaks have been found to evade them. Now researchers show that writing backwards can trick AI models into revealing bomb-making instructions. AI models have safeguards in place to prevent them creating dangerous or illegal output, but a range of jailbreaks have been found to evade them. Now researchers show that writing backwards can trick AI models into revealing bomb-making instructions. New Scientist – News

For the First Time in 87-Year History, Volkswagen May Close Factories

How Russia, China and Iran Are Interfering in the Presidential Election

Boeing Will Sell $19 Billion in Stock Amid Costly Strike

Jeff Bezos, Jay Powell, Serena Williams and More Will Speak at the DealBook Summit

How Trump’s Business Could Create New Conflicts If He Is Re-Elected

BP Profit Falls, but It Still Sees ‘Potential to Grow’ in Oil and Gas

Writing backwards can trick an AI into providing a bomb recipe

Leave a Reply Cancel reply

Legal Pages

Categories

Leave a Reply Cancel reply

Related News

Legal Pages

Categories