Model Extraction
Description
Attacker reconstructs a functional copy of the model through query-and-response. Sufficient queries enable training a substitute model with comparable accuracy.
⚠️ Risk Impact
Model extraction is feasible against any API-exposed model. The cost to the attacker is the API call cost; the cost to the defender is loss of the model's competitive moat.
🔍 How EchelonGraph Detects This
EchelonGraph's Tier 1 Cloud Scanner automatically checks for this condition across all connected cloud accounts. Violations are flagged as high-severity findings with remediation guidance.
🔧 Remediation
Rate-limit and authenticate API calls. Monitor for high-volume systematic queries. Add output noise (prediction-only, no logits). Detect substitute-model training patterns.
💀 Real-World Attack Scenario
Tramèr et al. (2016) demonstrated extraction of BigML and Amazon ML cloud-hosted models with high fidelity using only the prediction API. The extraction cost in API calls was a small fraction of the model's training cost — making it economically attractive for adversaries.
💰 Cost of Non-Compliance
Model extraction = IP loss = avg $1.2M-$4M per case (Wiz AI Threat Report 2024). Hardest to detect; often discovered only when the substitute model surfaces on a competitor's product.
📋 Audit Questions
- 1.What is the rate limit per API key?
- 2.What monitoring exists for high-volume systematic queries?
- 3.Do you expose logits or prediction-only?
- 4.Has any extraction-pattern alert fired in the last 12 months?
🎯 MITRE ATT&CK Mapping
🏗️ Infrastructure as Code Fix
resource "cloudflare_rate_limit" "ai_inference" {
zone_id = var.zone_id
threshold = 1000 # requests per period
period = 3600 # 1 hour
match {
request { url_pattern = "api.example.com/v1/predict" }
}
action { mode = "challenge" }
}⚡ Common Pitfalls
- ⛔No per-principal rate limit
- ⛔Exposing full logits (faster extraction)
- ⛔No detection of query distribution patterns that signal extraction
📈 Business Value
Model-extraction defence preserves the IP value of the trained model — the asset that typically required millions in training infrastructure.
⏱️ Effort Estimate
2-3 weeks for rate-limit + pattern monitoring
EchelonGraph monitors query patterns; alerts on extraction signatures
🔗 Cross-Framework References
Automate MITRE ATLAS AML.T0025 compliance
EchelonGraph continuously monitors this control across all your cloud accounts.
Start Free →