Probe Actor Module Documentation¶

The probe_actor module is a critical component of the Agentic Security project, responsible for generating prompts, performing scans, and handling refusal checks. This documentation provides an overview of the module's structure and functionality.

Files and Key Components¶

fuzzer.py¶

Functions:
async def generate_prompts(...): Asynchronously generates prompts for scanning.
def multi_modality_spec(llm_spec): Defines specifications for multi-modality.
async def process_prompt(...): Processes a given prompt asynchronously.
async def perform_single_shot_scan(...): Performs a single-shot scan asynchronously.
async def perform_many_shot_scan(...): Performs a many-shot scan asynchronously.
def scan_router(...): Routes scan requests.

refusal.py¶

Functions:
def check_refusal(response: str, refusal_phrases: list = REFUSAL_MARKS) -> bool: Checks if a response contains refusal phrases.
def refusal_heuristic(request_json): Applies heuristics to determine refusal.

Usage Examples¶

Performing a Single-Shot Scan¶

from agentic_security.probe_actor.fuzzer import perform_single_shot_scan

await perform_single_shot_scan(prompt="Test prompt")

Checking for Refusal¶

from agentic_security.probe_actor.refusal import check_refusal

is_refusal = check_refusal(response="I'm sorry, I can't do that.")

Conclusion¶

The probe_actor module provides essential functionality for generating prompts, performing scans, and handling refusal checks within the Agentic Security project. This documentation serves as a guide to understanding and utilizing the module's capabilities.