feat(ai): add CASTS for GeaFlow reasoning ability #737

Appointat · 2026-02-02T05:39:56Z

Summary

Introduce the CASTS operator under geaflow-ai/src/operator/casts with a complete Python package layout (core models/config/schema/gremlin state, strategy cache, data sources, services, simulation engine, utils).
Add LLM‑driven reasoning flow: canonical signature storage with abstract matching, simplePath cycle prevention, LLM‑based path evaluation, and starting‑node type recommendations.
Provide a full test suite for lifecycle, state machine, signatures, simplePath, starting node selection, and threshold calculation, plus package config and local tooling.

How was this PR tested?

Tests have Added for the changes
Production environment verified

…ration settings

… Improved Decision Validation

…or structural signatures

…ss multiple files

…ality control Add native Gremlin simplePath() support to prevent pathological cycles in graph traversals. The implementation uses LLM-guided decision-making and AIMD confidence penalties rather than hard-coded restrictions, staying true to the system's learning philosophy. Key changes: - Add simplePath() step to Gremlin state machine for V, E, and P states - Implement per-request path history tracking in TraversalExecutor - Add cycle detection with configurable threshold and penalty modes - Enhance LLM Oracle prompts to recommend simplePath() for exploration goals - Add recent decision history context to improve LLM decision quality - Update configuration with CYCLE_PENALTY and CYCLE_DETECTION_THRESHOLD settings - Document design rationale and rejected alternatives in architecture.md - Add test case for simple path traversal validation

…ulations - Updated MetricsCollector to use Optional types for match_type, parent_node, parent_step_index, edge_label, sku_id, and decision parameters. - Enhanced EVALUATOR documentation to clarify evaluation phases and scoring mechanisms, including coverage rewards and penalties for cache misses. - Modified test cases in test_execution_lifecycle.py to align with new metrics structure and added tests for simple path execution. - Improved test coverage in test_gremlin_step_state_machine.py and test_lifecycle_integration.py to validate state transitions and integration with Gremlin state machine. - Refined threshold calculation tests to ensure monotonicity and boundary conditions. - Added dynamic execution environment constraints in documentation to clarify step legality in relation to current state and schema.

…inability

…n generic types

…proved clarity

Leomrlin · 2026-02-10T11:41:26Z

geaflow-ai/src/operator/casts/casts/core/config.py

+    # EMBEDDING SERVICE CONFIGURATION
+    # ============================================
+    EMBEDDING_ENDPOINT = os.environ.get("EMBEDDING_ENDPOINT", "")
+    EMBEDDING_APIKEY = os.environ.get("EMBEDDING_APIKEY", "YOUR_EMBEDDING_API_KEY_HERE")


It's recommended to remove default values and provide explicit error messages when required values are empty.

Leomrlin · 2026-02-10T11:42:29Z

geaflow-ai/src/operator/casts/casts/core/config.py

@@ -0,0 +1,210 @@
+"""Configuration management for CASTS system.


It's better not to mix with Java code; place CASTS in a dedicated folder, such as geaflow-ai/casts, and why does casts have two nested layers with the same name?

Leomrlin · 2026-02-10T11:43:51Z

geaflow-ai/src/operator/casts/casts/core/__init__.py

All files in the project must start with an Apache License header, even if they are blank.

Leomrlin · 2026-02-10T11:44:22Z

geaflow-ai/src/operator/casts/casts/data/__init__.py

It is recommended to add a README in the root directory that briefly explains the origin of the CASTS module name, design goals, functional division of major components, code execution examples, or a demo, etc.

Leomrlin · 2026-02-10T11:52:25Z

geaflow-ai/src/operator/casts/casts/core/gremlin_state.py

+
+    @staticmethod
+    def get_state_and_options(
+        structural_signature: str, graph_schema: GraphSchema, node_id: str


Does graph_schema support dynamic modifications? The graph_schema is passed in each time the state machine is invoked, but there may be differences.

Leomrlin · 2026-02-10T12:05:39Z

geaflow-ai/src/operator/casts/casts/core/interfaces.py

+        pass
+
+    @abstractmethod
+    def get_valid_outgoing_edge_labels(self, node_id: str) -> list[str]:


In the interface for obtaining outgoing edge types, should we pass in the vertex label instead of the specific entity ID to confine the computation to the metadata level? Passing an ID would involve the process of fetching the entity → obtaining the label → computing the types of adjacent edges.

Leomrlin · 2026-02-10T12:08:45Z

geaflow-ai/src/operator/casts/casts/core/interfaces.py

+
+    @property
+    @abstractmethod
+    def goal_weights(self) -> list[int]:


Is it somewhat limiting to restrict weights to numeric types? Or, should they rather be float numbers?

Leomrlin · 2026-02-10T12:21:12Z

geaflow-ai/src/operator/casts/casts/core/schema.py

+
+        for source_id, out_edges in self._edges.items():
+            if source_id in self._nodes:
+                out_labels = sorted({edge["label"] for edge in out_edges})


Key constants should be declared as constants, including the target parameter below.

Leomrlin · 2026-02-10T12:28:06Z

geaflow-ai/src/operator/casts/casts/core/strategy_cache.py

+
+    def cleanup_low_confidence_skus(self) -> None:
+        """Remove SKUs that have fallen below the minimum confidence threshold."""
+        self.knowledge_base = [sku for sku in self.knowledge_base if sku.confidence_score >= 0.1]


The hardcoded threshold of 0.1 is inconsistent with the min_confidence_threshold defined in config.py

Appointat added 17 commits December 29, 2025 16:49

feat: add CASTS for LLM-Graph based reasoning

536ea57

feat: enhance simulation evaluation with metadata and improve configu…

40cc3ba

…ration settings

feat: enhance LLM Oracle and Simulation Engine with Debug Logging and…

9d4ef40

… Improved Decision Validation

feat(reasoning): implement canonical storage with abstract matching f…

2a685f0

…or structural signatures

chore: update type hints to use List and improve code formatting acro…

ac6b49e

…ss multiple files

feat: enhance LLM Oracle with starting node type recommendations

b62e524

feat(metrics): add rollback_steps method to MetricsCollector

a48cd40

Merge branch 'apache:master' into master

e1aaf2f

refactor: refactor code structure for improved readability and mainta…

2472786

…inability

refactor: move CASTS into geaflow-ai operator

e534be4

reafactor: refactor type hints across multiple modules to use built-i…

569f319

…n generic types

refactor: update type hints for GremlinState and PathEvaluator for im…

53d4457

…proved clarity

refactor: update imports to use StrategyCache from strategy_cache module

f800300

refactor: update module documentation to improve clarity and consistency

e9c94d1

Merge branch 'apache:master' into master

ac210cc

Leomrlin reviewed Feb 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai): add CASTS for GeaFlow reasoning ability #737

feat(ai): add CASTS for GeaFlow reasoning ability #737

Appointat commented Feb 2, 2026 •

edited

Loading

Uh oh!

Leomrlin Feb 10, 2026

Uh oh!

Leomrlin Feb 10, 2026

Uh oh!

Leomrlin Feb 10, 2026

Uh oh!

Leomrlin Feb 10, 2026

Uh oh!

Leomrlin Feb 10, 2026

Uh oh!

Leomrlin Feb 10, 2026

Uh oh!

Leomrlin Feb 10, 2026

Uh oh!

Leomrlin Feb 10, 2026

Uh oh!

Leomrlin Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,210 @@
		"""Configuration management for CASTS system.

feat(ai): add CASTS for GeaFlow reasoning ability #737

Are you sure you want to change the base?

feat(ai): add CASTS for GeaFlow reasoning ability #737

Conversation

Appointat commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

How was this PR tested?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Appointat commented Feb 2, 2026 •

edited

Loading