Shop Smart, Save More, and Enjoy Unmatched Value on Every Purchase!

Researchers find just 250 malicious documents can leave LLMs vulnerable to backdoors

Artificial intelligence companies have been working at breakneck speeds to develop the best and most powerful tools, but that rapid development hasn’t always been coupled with clear understandings of AI’s limitations or weaknesses. Today, Anthropic released a report on how attackers can influence the development of a large language model.

The study centered on a type of attack called poisoning, where an LLM is pretrained on malicious content intended to make it learn dangerous or unwanted behaviors. The key finding from this study is that a bad actor doesn’t need to control a percentage of the pretraining materials to get the LLM to be poisoned. Instead, the researchers found that a small and fairly constant number of malicious documents can poison an LLM, regardless of the size of the model or its training materials. The study was able to successfully backdoor LLMs based on using only 250 malicious documents in the pretraining data set, a much smaller number than expected for models ranging from 600 million to 13 billion parameters. 

“We’re sharing these findings to show that data-poisoning attacks might be more practical than believed, and to encourage further research on data poisoning and potential defenses against it,” the company said. Anthropic collaborated with the UK AI Security Institute and the Alan Turing Institute on the research.

Trending Products

0
Add to compare
- 20%
Sceptre Curved 24-inch Gaming Monitor 1080p R1500 98% sRGB HDMI x2 VGA Build-in Speakers, VESA Wall Mount Machine Black (C248W-1920RN Series)

Sceptre Curved 24-inch Gaming Monitor 1080p R1500 98% sRGB HDMI x2 VGA Build-in Speakers, VESA Wall Mount Machine Black (C248W-1920RN Series)

Original price was: $99.97.Current price is: $79.97.
0
Add to compare
- 31%
Acer Nitro 27″ WQHD 2560 x 1440 PC Gaming IPS Monitor | AMD FreeSync Premium As much as 180Hz Refresh 0.5ms DCI-P3 95% 1 Show Port 1.2 & 2 HDMI 2.0 XV271U M3bmiiprx,Black

Acer Nitro 27″ WQHD 2560 x 1440 PC Gaming IPS Monitor | AMD FreeSync Premium As much as 180Hz Refresh 0.5ms DCI-P3 95% 1 Show Port 1.2 & 2 HDMI 2.0 XV271U M3bmiiprx,Black

Original price was: $289.99.Current price is: $199.99.
0
Add to compare
- 20%
SAMSUNG 27″ T35F Sequence FHD 1080p Laptop Monitor, 75Hz, IPS Panel, HDMI, VGA (D-Sub), 3-Sided Border-Much less, FreeSync, LF27T350FHNXZA

SAMSUNG 27″ T35F Sequence FHD 1080p Laptop Monitor, 75Hz, IPS Panel, HDMI, VGA (D-Sub), 3-Sided Border-Much less, FreeSync, LF27T350FHNXZA

Original price was: $149.99.Current price is: $119.99.
0
Add to compare
- 7%
HP Notebook Laptop, 15.6″ HD Touchscreen, Intel Core i3-1115G4 Processor, 32GB RAM, 1TB PCIe SSD, Webcam, Type-C, HDMI, SD Card Reader, Wi-Fi, Windows 11 Home, Silver

HP Notebook Laptop, 15.6″ HD Touchscreen, Intel Core i3-1115G4 Processor, 32GB RAM, 1TB PCIe SSD, Webcam, Type-C, HDMI, SD Card Reader, Wi-Fi, Windows 11 Home, Silver

Original price was: $444.92.Current price is: $415.00.
.

We will be happy to hear your thoughts

Leave a reply

ShopSmartToday
Logo
Register New Account
Compare items
  • Total (0)
Compare
0
Shopping cart