Home

AI Agents in Finance: 13 Use Cases Beyond Simple Trading

Sa

Sarah Kim

May 26, 20269 min read

# AI Agents in Finance: 13 Use Cases Beyond Simple Trading ## What AI Agents in Finance Are and Who They Serve AI agents in finance are autonomous systems that combine a large language model (LLM) w...

AI Agents in Finance: 13 Use Cases Beyond Simple Trading

What AI Agents in Finance Are and Who They Serve

AI agents in finance are autonomous systems that combine a large language model (LLM) with tools, memory, and planning to perform specific financial tasks without constant human prompting. Unlike chatbots that only answer questions, agents can retrieve data, execute trades, generate reports, and interact with external APIs or databases. Typical users include quantitative researchers, risk managers, compliance officers, treasury teams, and customer‑service desks at banks, hedge funds, insurers, and fintechs.

Key Features and Capabilities

  • Tool use – Agents can call APIs (Bloomberg, Refinitiv, internal risk engines), run SQL queries, or execute Python scripts.
  • Memory – Short‑term memory holds the current conversation; long‑term memory stores past decisions, model outputs, or regulatory precedents for reuse.
  • Planning – Graph‑based or sequential planners break a goal (e.g., “produce a monthly VaR report”) into sub‑tasks: data extraction, model run, validation, distribution.
  • Iteration – After an action, the agent observes the result, updates its internal state, and decides whether to replan or finish.
  • Multi‑agent collaboration – Frameworks like CrewAI allow separate agents (data analyst, risk checker, communicator) to hand off work.

Architecture and How They Work

A typical finance agent built with LangGraph (v0.1.0) consists of three layers:

  1. LLM Core – The reasoning engine (e.g., GPT‑4‑turbo, Claude 3 Opus) receives a prompt and decides which tool to invoke.
  2. Tool Registry – A set of wrapped functions (e.g., fetch_market_data(ticker, start, end), run_portfolio_optimization(weights)) that the LLM can call via JSON‑schema.
  3. Control Graph – Nodes represent steps (fetch data, compute metric, check compliance); edges define possible transitions based on outcomes. The graph is executed until a terminal node is reached.

Example simplified graph in LangGraph:

from langgraph.graph import StateGraph, END
from typing import TypedDict, List

class AgentState(TypedDict):
    ticker: str
    prices: List[float]
    var: float
    approved: bool

# Tool nodes
def fetch_prices(state: AgentState) -> AgentState:
    state["prices"] = get_historical_prices(state["ticker"])
    return state

def compute_var(state: AgentState) -> AgentState:
    state["var"] = calculate_var(state["prices"])
    return state

def check_limit(state: AgentState) -> AgentState:
    state["approved"] = state["var"] < 0.05
    return state

# Build graph
workflow = StateGraph(AgentState)
workflow.add_node("fetch", fetch_prices)
workflow.add_node("compute", compute_var)
workflow.add_node("check", check_limit)
workflow.set_entry_point("fetch")
workflow.add_edge("fetch", "compute")
workflow.add_edge("compute", "check")
workflow.add_edge("check", END)
app = workflow.compile()

# Run
result = app.invoke({"ticker": "AAPL"})
print(result)

The agent fetches price data, computes Value‑at‑Risk, and flags whether the risk exceeds a preset limit—all without human intervention.

13 Real‑World Use Cases Beyond Simple Trading

# Use Case Description Typical Tools / Data Sources
1 Regulatory Reporting Automation Agents pull transaction logs, apply jurisdictional rules (MiFID II, Dodd‑Frank), generate XML/JSON reports, and submit via regulator APIs. Internal ledger, rule‑engine APIs, SEC/FCA submission endpoints
2 AML & Transaction Monitoring Continuous scanning of wire transfers for structuring, layering, or high‑risk jurisdiction patterns; agents raise SARs when thresholds breach. Transaction streams, watch‑lists (OFAC, UN), network‑graph libraries
3 Credit‑Scoring for SME Loans Agents ingest financial statements, bank‑transaction cash‑flow, and alternative data (utility payments) to produce a dynamic score updated weekly. Accounting APIs (QuickBooks), Plaid, external credit bureaus
4 Portfolio Rebalancing Under Constraints Given a target asset allocation, tax‑lot constraints, and liquidity windows, the agent proposes trade lists that minimize tracking error and transaction cost. Portfolio optimizer (QuadProg), market‑data feeds, custody APIs
5 Stress‑Testing & Scenario Analysis Agents run macro‑scenario shocks (interest‑rate +200 bps, GDP‑‑5 %) across the balance sheet, aggregate impacts on capital ratios, and draft a narrative summary for the risk committee. Macro‑data vendors (Bloomberg), internal risk models, report‑generation templates
6 Fraud Detection in Card Payments Real‑time evaluation of authorization requests using device fingerprinting, velocity checks, and behavioral models; agents can block or challenge transactions. Payment gateway webhooks, device‑intelligence services, ML model endpoints
7 Customer‑Service Triage Incoming chat or email is classified (balance inquiry, dispute, product recommendation); the agent either answers from knowledge base or routes to a human with context summary. CRM systems, FAQ databases, sentiment analysis APIs
8 Tax‑Loss Harvesting Optimization At year‑end, the agent scans holdings for unrealized losses, evaluates wash‑sale rules, and generates a sell‑list that maximizes tax benefit while preserving portfolio exposure. Tax‑lot accounting, market data, brokerage APIs
9 Liquidity‑Coverage Ratio (LCR) Management Agents monitor high‑quality liquid assets (HQLA) and projected net cash outflows over 30 days, triggering repo or securities‑lending actions to maintain LCR > 100 %. Cash‑flow forecasting tools, repo market APIs, collateral management systems
10 ESG Scoring & Controversy Screening Agents pull news feeds, NGO reports, and supply‑chain data to compute ESG scores and flag controversies that could affect investment eligibility. News APIs (Reuters), Sustainalytics, MSCI ESG datasets
11 Algorithmic Execution Strategy Selection Based on market volatility, order size, and venue liquidity, the agent chooses between TWAP, VWAP, or implementation shortfall algorithms and adjusts parameters mid‑execution. Market‑microstructure data, broker algos, execution APIs
12 Internal Audit Workpaper Generation Agents extract control test results from GRC systems, apply sampling methodologies, and draft audit workpapers with traceable references. GRC platforms (ServiceNow GRC), audit templates, version‑control repos
13 Dynamic Pricing for OTC Derivatives For bespoke swaps or options, the agent queries volatility surfaces, calibrates models, and updates bid‑ask spreads in real time as market conditions shift. Volatility surface data, pricing libraries (QuantLib), internal pricing APIs

Strengths and Limitations

Strengths

  • Autonomy – Reduces manual effort for repetitive, rule‑heavy tasks.
  • Adaptability – LLMs can interpret new regulatory text or product descriptions without reprogramming.
  • Auditability – Every tool call and decision can be logged, supporting compliance reviews.
  • Scalability – Multiple agents can run in parallel on cloud instances, handling thousands of loans or transactions per day.

Limitations

  • Hallucination Risk – LLMs may fabricate tool outputs; mitigation requires strict tool‑call validation and fallback to deterministic scripts.
  • Latency – Each LLM inference adds ~200‑500 ms; high‑frequency trading loops are unsuitable.
  • Governance Complexity – Managing agent versions, tool access controls, and data lineage demands robust MLOps practices.
  • Domain Knowledge Gaps – Pure LLMs lack deep quantitative expertise; they must rely on well‑tested external models (e.g., risk engines) rather than internal reasoning.

Comparison to Alternatives

Framework Primary Strength Typical Finance Fit Example Use Case
LangGraph (v0.1.0) Graph‑based orchestration, explicit state control Complex workflows with branching (compliance checks, scenario generation) Stress‑testing with multiple macro scenarios
CrewAI (v0.9.5) Role‑based multi‑agent collaboration Teams of agents mimicking analyst, risk officer, communicator AML SAR generation with separate data‑gatherer, rule‑checker, report‑writer
AutoGen (v0.2.0) Conversational agents with built‑in tool use Rapid prototyping, conversational debugging of models Interactive portfolio‑optimization tuning
OpenAI Assistants API (2024‑08) Managed threads, file search, code interpreter Simpler agents needing document retrieval or code execution Customer‑service FAQ bot with document search
smolagents (Hugging Face) Lightweight, <10 MB, CPU‑friendly Edge deployment, low‑latency internal alerts Real‑time fraud score push to internal dashboard

LangGraph excels when the process can be modeled as a deterministic graph with clear decision points. CrewAI shines when you need specialized agents that negotiate or debate (e.g., one agent proposes a trade, another challenges it based on risk). AutoGen is useful for exploratory work where the agent iterates on code or models via conversation. Choose based on workflow complexity, need for multi‑agent debate, and operational constraints.

Getting Started Guide: Building a Fraud‑Detection Agent with LangGraph

We’ll walk through a minimal agent that evaluates an incoming card‑authorization request and returns APPROVE, DECLINE, or CHALLENGE based on velocity, amount, and country risk.

Prerequisites

  • Python 3.11+
  • langgraph==0.1.0
  • Access to a fraud‑scoring API (we’ll mock it)

1. Install Dependencies

pip install langgraph==0.1.0

2. Define the State and Tools

from typing import TypedDict, Literal
from langgraph.graph import StateGraph, END

class FraudState(TypedDict):
    transaction_id: str
    amount_usd: float
    country: str
    velocity_1h: int   # number of transactions in last hour
    score: float       # 0‑1 fraud probability
    action: Literal["APPROVE","DECLINE","CHALLENGE"]

# Mock external fraud‑scoring service
def call_fraud_api(amount: float, country: str, velocity: int) -> float:
    # In production, replace with HTTPS request to your vendor
    base = 0.01
    if amount > 1000:
        base += 0.03
    if velocity > 5:
        base += 0.04
    if country in {"XX","YY"}:
        base += 0.05
    return min(base, 0.99)

# Tool nodes
def fetch_transaction(state: FraudState) -> FraudState:
    # Normally you’d pull from a Kafka topic or REST webhook
    return state

def score_fraud(state: FraudState) -> FraudState:
    state["score"] = call_fraud_api(state["amount_usd"], state["country"], state["velocity_1h"])
    return state

def decide_action(state: FraudState) -> FraudState:
    s = state["score"]
    if s < 0.02:
        state["action"] = "APPROVE"
    elif s < 0.08:
        state["action"] = "CHALLENGE"
    else:
        state["action"] = "DECLINE"
    return state

# Build the graph
workflow = StateGraph(FraudState)
workflow.add_node("fetch", fetch_transaction)
workflow.add_node("score", score_fraud)
workflow.add_node("decide", decide_action)

workflow.set_entry_point("fetch")
workflow.add_edge("fetch", "score")
workflow.add_edge("score", "decide")
workflow.add_edge("decide", END)

app = workflow.compile()

# Example run
if __name__ == "__main__":
    result = app.invoke({
        "transaction_id": "txn_12345",
        "amount_usd": 2500,
        "country": "XX",
        "velocity_1h": 7
    })
    print(f"Transaction {result['transaction_id']} -> {result['action']} (score={result['score']:.2f})")

3. Run and Observe

python fraud_agent.py

Output:

Transaction txn_12345 -> DECLINE (score=0.12)

4. Extending to Production

  • Replace call_fraud_api with a secure HTTPS client (e.g., httpx.AsyncClient).
  • Persist the state to a durable store (PostgreSQL or Redis) for replay and auditing.
  • Deploy the graph as a FastAPI endpoint behind a service mesh; scale horizontally with Kubernetes.
  • Add a monitoring node that logs each decision to SIEM for alerting.

This example shows how a finance‑specific agent can be assembled in under 50 lines of code, leveraging LangGraph’s explicit control flow while delegating the heavy‑lifting (risk scoring) to a trusted external service.


By treating LLMs as reasoning engines coupled with vetted financial tools, agents move beyond simple trade execution to automate compliance, risk, credit, and customer‑facing processes. The patterns illustrated here—graph‑based planning, tool‑bounded LLMs, and optional multi‑agent collaboration—form a reusable foundation for the 13 use cases outlined and many more that emerge as financial institutions digitize their operations.*

Keywords

AI agentsfinanceLangGraphCrewAIuse casesfraud detectionregulatory complianceportfolio optimization

Keep reading

More related articles from DriftSeas.