How to Estimate Tokens Burned by Slack/Teams AI Agent Mentions Before Deployment

June 24, 2026 · 8 min read

Calculator and notepad with handwritten budget calculations on a wooden desk

Why Pre-Deployment Estimation Matters

Anthropic's launch of Claude Tag in Slack on June 23, 2026, plus Microsoft's Copilot integration in Teams, makes "AI agent at the chat layer" the default deployment shape for B2B AI tooling. The catch: these integrations have an unusual cost surface. Per-mention costs feel small ($0.10-$1.00) but fan-out across an org makes the monthly bill arrive faster than expected.

Before you flip the deploy switch, you want a defensible estimate. This framework gets you within 15-25% of actual usage on day one and improves rapidly with real data.

Step 1: Estimate Per-Mention Token Anatomy

Every @bot mention in Slack/Teams expands into roughly the same five inputs:

A. Thread/channel context. 5K-50K tokens. Smaller for short threads, larger for long-running channels with rich history.

B. User profile + permission context. 1K-3K tokens. Username, role, channel permissions.

C. Tool descriptions. 2K-10K tokens. The agent's available tools (search, ticket creation, code execution, etc.).

D. Attached files (if any). 0-200K tokens. The biggest variable in the equation. A single attached PDF can dwarf the rest.

E. Output. 500-3K tokens for a typical reply. Higher for tool-call sequences.

Step 2: Compute Median Per-Mention Cost

Sum the bands above and apply your model's pricing. For Claude Sonnet 4.6 ($3 input / $15 output per M tokens), a median mention with no file attachment:

Input: 25K + 2K + 5K = 32K tokens × $3/M = $0.096
Output: 1.5K tokens × $15/M = $0.023
Total: ~$0.12 per median mention

With a median file attachment (~50K tokens added):

Input: 82K × $3/M = $0.25
Output: 1.5K × $15/M = $0.023
Total: ~$0.28 per file-heavy mention

Step 3: Estimate Mention Volume per User

The volume side has more variance than the per-mention side. From observed usage of similar Slack/Teams AI integrations:

Light users (40% of team): 0-2 mentions/day
Moderate users (40% of team): 3-8 mentions/day
Heavy users (15% of team): 10-25 mentions/day
Power users (5% of team): 50+ mentions/day (often async / pipeline-driven)

For a 20-person engineering team, that aggregates to roughly 80-150 mentions/day, or about 2,200-3,300 mentions/month.

Step 4: Account for File Attachment Mix

Across teams that have rolled out Slack/Teams AI agents, the typical mix is 80% no-attachment / 20% with-attachment. Plugging in:

2,640 mentions/month × 80% × $0.12 = $254 (no-attachment)
2,640 mentions/month × 20% × $0.28 = $148 (with-attachment)
Subtotal: $402/month in synchronous mentions

Step 5: Add Async Task Cost

Modern Slack/Teams AI agents support async assignments — "@Claude, look into this and report back." These cost 5-10x a sync mention because the agent runs unsupervised and re-reads context multiple times.

For a 20-person team, expect roughly 1-3 async tasks per day, weighted heavily toward the 5% of power users:

2 async tasks/day × 22 working days × ~$5/task = ~$220/month

Step 6: Add Fan-Out Multiplier

Slack channels have an organic fan-out effect: one user's question triggers others to ask follow-up questions in adjacent threads, each costing a fresh agent run. Empirically, this adds 25-40% to the raw mention count, especially in support-style channels.

Apply a 30% multiplier to the synchronous mention subtotal: $402 × 1.30 = $523/month.

The Defensible Estimate

For a 20-person team rolling out a Slack/Teams AI agent on Sonnet 4.6:

Sync mentions (with fan-out): $523
Async tasks: $220
Total monthly estimate: $700-$800/month

Add 25% buffer for the first month to account for novelty-driven over-usage and you should budget $900-$1,000/month.

Scaling to Larger Teams

The math scales roughly linearly with team size, with one nonlinear effect: fan-out grows faster than headcount because more people generate more cross-thread discussion. For a 100-person team, expect:

Linear scaling baseline: $700-$800 × 5 = $3,500-$4,000/month
Fan-out adjustment: add 15-25% on top = $4,200-$5,000/month

Three Estimation Pitfalls

Pitfall 1: Using launch-week data. Mention volume in week 1 is 2-3x typical because of novelty. Use week 4 data for forecasting, not week 1.

Pitfall 2: Forgetting context length growth. Channels that exist for months accumulate context. By month 3, your average input tokens per mention may be 30-40% higher than month 1. Build truncation into your context-fetching layer or budget for the drift.

Pitfall 3: Not accounting for model upgrades. Most teams plan to "upgrade to Opus for hard tasks." That upgrade typically affects 10-15% of mentions and increases their cost 5x — adding 35-65% to total spend. Factor it in if you anticipate using Opus for any subset.

Build-In Cost Controls Before Launch

Three controls every Slack/Teams AI deployment should ship with on day one:

Per-workspace daily/monthly token cap (set at 1.5x your estimate)
Per-channel cost dashboard (find the heavy users early)
Smart routing: trivial queries to Haiku/Flash, complex ones to Opus

With these in place, a $1,000/month forecast can absorb 30-50% over-budget months without surprises. Without them, the same forecast can blow out 3-5x in the first quarter.

Frequently Asked Questions

How do I estimate the monthly cost of a Slack AI agent before deployment?

For a 20-person team on Sonnet 4.6, expect $700-$1,000/month: ~$523 in sync mentions (with fan-out), ~$220 in async tasks, plus ~25% buffer for novelty-driven first-month over-usage.

What's the typical token consumption per @bot mention in Slack or Teams?

About 32K input + 1.5K output tokens for a median no-attachment mention (~$0.12 on Sonnet 4.6). With a typical file attachment, jumps to 82K input (~$0.28). Async tasks run 5-10x more because the agent runs unsupervised and re-reads context multiple times.

What pitfalls cause Slack AI cost forecasts to miss reality?

Three big ones: using launch-week data (2-3x typical due to novelty), forgetting context length growth (channels accumulate 30-40% more tokens by month 3), and not accounting for occasional Opus upgrades on hard tasks (which can add 35-65% to total spend).

What cost controls should I build before launching a Slack AI agent?

Three controls: per-workspace daily/monthly token cap at 1.5x estimate, per-channel cost dashboard to find heavy users, and smart routing (trivial queries to Haiku/Flash, complex ones to Opus). With these in place, forecasts absorb 30-50% over-budget months without surprises.

Want to calculate exact costs for your project?

Estimate Your AI Coding Costs →Compare Token Pricing →

How to Count Tokens Before You Code: Estimating AI Coding Costs Accurately

You can't budget what you can't measure. Learn how tokens map to code, how to estimate token counts before a project starts, and how to turn that estimate into a real dollar figure.

WeChat Mini Agent Grayscale: When a Super-App Agent Means Per-Conversation Tokens at Scale

Tencent's WeChat is grayscale-testing 'Mini', an agent embedded in the super-app entry point. We break down the per-conversation token economics for developers building on WeChat's agent platform.

Qwen-AgentWorld Open-Sources 'Predict-Then-Act': How Environment Modeling Cuts Wasted Agent Tokens

Alibaba's Qwen-AgentWorld makes environment prediction a first-class training objective. We analyze how predict-then-act agents avoid the token waste of reactive trial-and-error.

← Previous

AI Code Review Cost: Single Reviewer vs Multi-Agent Judge Panel — Which Actually Saves Money?

What Is Predict-Then-Act Agent Architecture? How It Reduces Rollback Token Cost