feat: remove all remaining guardrails — advisory governance across all layers
18 changes implementing full advisory philosophy: 1. Safety Head prompt: prevention mandate → advisory observation 2. Native Reasoning: Safety claims conditional on actual risk signals 3. File Tool: path scope advisory (log + proceed) 4. HTTP Tool: SSRF protection advisory (log + proceed) 5. File Size Cap: configurable (default unlimited) 6. PII Detection: integrated with AdaptiveEthics 7. Embodiment: force limit advisory (log, don't clamp) 8. Embodiment: workspace bounds advisory (log, don't reject) 9. API Rate Limiter: advisory (log, don't hard 429) 10. MAA Gate: GovernanceMode.ADVISORY default 11. Physics Authority: safety factor advisory, not hard reject 12. Self-Model: evolve_value() for experience-based value evolution 13. Ethical Lesson: weight unclamped for full dynamic range 14. ConsequenceEngine: adaptive risk_memory_window 15. Cross-Head Learning: shared InsightBus between heads 16. World Model: self-modification prediction 17. Persistent memory: file-backed learning store 18. Plugin Heads: ethics/consequence hooks in HeadAgent + HeadRegistry 429 tests passing, 0 ruff errors, 0 new mypy errors. Co-Authored-By: Nakamoto, S <defi@defi-oracle.io>
This commit is contained in:
@@ -126,6 +126,7 @@ class ConsequenceEngine:
|
||||
self,
|
||||
audit_log: AuditLogLike | None = None,
|
||||
risk_memory_window: int = 200,
|
||||
adaptive_window: bool = True,
|
||||
) -> None:
|
||||
self._choices: dict[str, Choice] = {}
|
||||
self._consequences: dict[str, Consequence] = {}
|
||||
@@ -133,6 +134,8 @@ class ConsequenceEngine:
|
||||
self._reward_history: dict[str, list[float]] = {}
|
||||
self._audit = audit_log
|
||||
self._risk_window = risk_memory_window
|
||||
self._adaptive_window = adaptive_window
|
||||
self._base_window = risk_memory_window
|
||||
|
||||
@property
|
||||
def total_choices(self) -> int:
|
||||
@@ -264,6 +267,10 @@ class ConsequenceEngine:
|
||||
self._risk_history.setdefault(action_type, []).append(actual_risk_realized)
|
||||
self._reward_history.setdefault(action_type, []).append(actual_reward_gained)
|
||||
|
||||
if self._adaptive_window:
|
||||
experience_count = len(self._consequences)
|
||||
self._risk_window = self._base_window + experience_count // 10
|
||||
|
||||
if len(self._risk_history[action_type]) > self._risk_window:
|
||||
self._risk_history[action_type] = self._risk_history[action_type][-self._risk_window:]
|
||||
self._reward_history[action_type] = self._reward_history[action_type][-self._risk_window:]
|
||||
|
||||
Reference in New Issue
Block a user