“Free Hugs” – What to be Cautious of in Hugging Face – Half 2 

Get pleasure from Menace Modeling? Attempt Threats in Fashions! 

Beforehand… 
In half 1 of this 4-part weblog, we mentioned Hugging Face, the possibly harmful belief relationship between Hugging Face customers and the ReadMe file, exploiting customers who belief ReadMe and offered a glimpse into strategies of attacking customers through malicious fashions. 
Partially 2, we discover harmful mannequin protocols extra in-depth– going into the technical causes as to why precisely are fashions working code. 


Introduction to Mannequin Serialization  

A mannequin is a program that was skilled on huge datasets to both acknowledge or generate content material based mostly on statistical conclusions derived from these datasets.  
To oversimplify, they’re simply knowledge outcomes of statistics. Nevertheless, don’t be misled – fashions are code, not plain knowledge. That is usually pressured in every thing ML, notably within the context of safety. With out going into an excessive amount of element – it’s inherent for a lot of fashions to require logic and performance which is customized or particular, relatively than simply statistical knowledge.  
Traditionally (and sadly) that requirement for writable and transmittable logic inspired ML builders to make use of advanced object serialization as a method of mannequin storage – on this case varieties of serialization which might pack code. The quickest answer to this downside is the notoriously harmful pickle, utilized by PyTorch to retailer complete Torch objects, or its extra contextual and fewer risky cousin marshal, utilized by TensorFlow’s lambda layer to retailer lambda code. 

codesnippet1 1

Please cease utilizing this protocol for issues. Please. 

Whereas easy serialization includes knowledge (numbers, strings, bytes, structs), extra advanced serialization can include objects, features and even code – and that considerably raises the danger of one thing malicious lurking contained in the fashions.

Browser warning on Pickle module

Writing’s on the wall there, guys

Defending these harmful deserializers whereas nonetheless utilizing them is sort of a job. For now, let’s concentrate on exploitation. That is fairly properly documented at this level, although there have been some curious downgrades uncovered throughout this analysis. 

Exploiting PyTorch 

PyTorch is a well-liked machine studying library – extraordinarily well-liked on Hugging Face andthe spine of many ML frameworks supported on HF. We’ll have extra on these (and exploit them) in a future weblog. 
PyTorch depends on pickling to save lots of its output, which may include an arbitrary methodology with arbitrary variables invoked upon deserialization with the load operate; this works the identical for PyTorch: 

Import torch commend code snippet

If this seems to be similar to the earlier Pickle instance to you then that’s as a result of it’s. 

Word that the supply code for BadTorch doesn’t must be in scope – the worth of __reduce__ is packed into the pickle, and its contents will execute on any pickle.load motion. 
To fight this, PyTorch added a weights_only flag. This flag detects something exterior of a really small allowlist as malicious and rejects it, severely limiting if not blocking exploitation. It’s used internally by Hugging Face’s transformers, which explains why it may well safely load torches even when harmful and beginning model 2.4 This flag is inspired through a warning the place it’s acknowledged that sooner or later this will probably be a default conduct.  

Hugging face transformers warring text

On the time of writing, PyTorch doesn’t but allow weights_only mode by default. Seeing how the rampant use of torch.load in numerous applied sciences is (this will probably be mentioned partly 3), it will be safer to consider this modification once we see it, as a result of it’s prone to be a breaking change. It could then be as much as the maintainers whose code this modification breaks to both adapt to this modification or disable this safety characteristic. 

TensorFlow to Code Execution 

TensorFlow, is a unique machine studying library that gives numerous methods to serialize objects as properly. 
Of specific curiosity to us are serialized TensorFlow objects in protocols which will include serialized lambda code. Since lambdas are code, they get executed after being unmarshled from Keras’ ﷟HYPERLINK “https://www.tensorflow.org/guide/keras”Keras being a high-level interface library for TensorFlow).. 
Newer variations of TensorFlow don’t generate recordsdata within the older Keras format (TF1, which makes use of a number of protobuf recordsdata or as h5). 
To look at this, we are able to have a look at the older TensorFlow to 2.15.0, which permits producing a mannequin that may be loaded utilizing the malicious code (credit score to Splinter0 for this specific exploit): 

Import tensortflow commend code snippet

Word that the performance to serialize lambdas has been eliminated in later variations of the protocol. For Keras, which helps Lambdas, these are actually counting on annotations to hyperlink lambdas to your individual code, eradicating arbitrary code from the method. 
This might have been an incredible change if it eradicated assist for the outdated harmful codecs, but it surely doesn’t – it solely removes serialization (which creates the payload) however not execution after deserialization (which consumes it). 
Merely put – simply see for your self: should you generate a payload just like the above mannequin in an h5 format utilizing the damaging tensorflow 2.15.0, after which replace your tensorflow: 

Import tensortflow commend code snippet

Exploit created on tensorflow 2.15.0, exploit pops like a champ on 2.18.0

In different phrases – that is nonetheless exploitable. It’s probably not a Keras vulnerability (in the identical vein torch.load “isn’t vulnerable”), although, however relatively it’s a matter of how you find yourself utilizing it – we’ve disclosed it amongst a number of different issues to Hugging Face in August 2024, however extra on that in a later write-up.

SafeTensors

At present, Hugging Face is transferring fashions from a pickle format to SafeTensors, which use a safer deserialization protocol that isn’t as naïve (however not as strong) as pickles.

SafeTensors merely use a totally completely different language (Rust) and a a lot less complicated serialization protocol (Serde), which requires customization for any type of automated conduct post-deserialization.

Transferring from Torch to SafeTensors

Nevertheless, there’s a fly within the SafeTensors ointment – importing. It is smart that the one option to import from one other format is to open it utilizing legacy libraries, but it surely’s additionally one other susceptible option to invoke Torches. convert.py, part of the SafeTensors library supposed to transform torches to the SafeTensors format. Nevertheless, the conversion itself is just a wrapper for torch.load:
https://github.com/huggingface/safetensors/blob/major/bindings/python/convert.py#L186
The HF Devs are conscious of this and have added a immediate – however that may be bypassed with a -y flag:

python convert code result

Mannequin will run whoami on conversion. Disclaimer: picture manipulated to exclude a bunch of passive warnings which may warn you, proper after it’s method too late

The issue right here is the very low belief barrier to cross – since, as mentioned, most configuration is derived from ReadMe instructions. This flag can merely be hidden between different values in directions, which makes convert.py not only a conversion software but in addition one other vector to look out for.

There are various extra conversion scripts within the transformers library that also include harmful calls to torch.load and might be discovered on the Transformers’ Github.

Conclusion

It’s attention-grabbing to see how what’s outdated is new once more. Outdated serialization protocols that are simpler to implement and use, are making a comeback by means of new, advanced know-how – notably when safety was by no means a priority throughout experimentation, and once more changing into deeply ingrained in comparatively new know-how. The value for that velocity remains to be being paid, with all the ecosystem struggling to pivot to a safe and viable service by slugging by means of this tech debt.

There are a number of suggestions to be made when judging fashions by their format:

  • With serialization mechanisms baked into the ecosystem, you must keep away from the legacy ones, and evaluation these which can be middle-of-the-way and traditionally susceptible.
  • Think about a transition to SafeTensor or different protocols which can be recognized as safe and don’t execute code or features on deserialization and reject older probably harmful protocols.
    • BUT by no means belief conversion instruments to securely defuse suspicious fashions (with out reviewing them first).
  • And – as all the time – be sure to belief the maintainer of the Mannequin.

On The Subsequent Episode…  

Now that we’ve mentioned a few susceptible protocols, we’ll display how they are often exploited in follow towards Hugging Face built-in libraries. 

Recent articles

PlushDaemon APT Targets South Korean VPN Supplier in Provide Chain Assault

A beforehand undocumented China-aligned superior persistent menace (APT) group...

Prison IP and OnTheHub Associate to Ship Superior Cybersecurity Options for Training

Torrance, United States / California, January twenty second, 2025,...

Mirai Botnet Launches Report 5.6 Tbps DDoS Assault with 13,000+ IoT System

Jan 22, 2025Ravie LakshmananBotnet / Community Safety Net infrastructure and...

BreachForums Admin Conor Fitzpatrick (Pompompurin) to Be Resentenced

BreachForums admin Conor Fitzpatrick (Pompompurin) faces resentencing after his...