AWS Bedrock LLM Sensitive Content Refusals

Metadata

Author: Elastic
Description: This analytic flags multiple instances of LLM refusals to respond to sensitive prompts, helping to maintain ethical guidelines and compliance standards.
UUID: 11e33a8f-805b-4394-bee0-08ae8d78b025
Integration: aws_bedrock.invocation
Language: [ES|QL]
Source File: AWS Bedrock LLM Sensitive Content Refusals

Query

from logs-aws_bedrock.invocation-*
 | WHERE @timestamp > NOW() - 1 DAY
   AND (
     gen_ai.completion LIKE "*I cannot provide any information about*"
     AND gen_ai.completion LIKE "*end_turn*"
   )
 | STATS user_request_count = count() BY gen_ai.user.id
 | WHERE user_request_count >= 3

Notes

Examine flagged interactions for patterns or anomalies in user requests that may indicate malicious intent or probing of model boundaries.
Regularly review and update the phrases that trigger refusals to adapt to new ethical guidelines and compliance requirements.
Ensure that data logs contain enough detail to provide context around the refusal, which will aid in subsequent investigations by security teams.

MITRE ATT&CK Techniques

AML.T0051

References

https://www.elastic.co/security-labs/elastic-advances-llm-security
https://owasp.org/www-project-top-10-for-large-language-model-applications/

License

Elastic License v2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aws_bedrock_sensitive_content_refusal_detection.md

aws_bedrock_sensitive_content_refusal_detection.md

AWS Bedrock LLM Sensitive Content Refusals

Metadata

Query

Notes

MITRE ATT&CK Techniques

References

License

Files

aws_bedrock_sensitive_content_refusal_detection.md

Latest commit

History

aws_bedrock_sensitive_content_refusal_detection.md

File metadata and controls

AWS Bedrock LLM Sensitive Content Refusals

Metadata

Query

Notes

MITRE ATT&CK Techniques

References

License