📊 API Usage

Aaia-WP includes comprehensive API usage monitoring to help you track your Gemini API consumption, optimize costs, and ensure you stay within your quota limits. This feature provides detailed analytics and insights into your chatbot’s performance.

🎯 Why Monitor API Usage?

Understanding your API consumption is crucial for:

Cost Control: Track estimated costs to avoid unexpected bills
Performance Optimization: Identify usage patterns to optimize your chatbot
Quota Management: Stay within Google’s free tier limits
Analytics Insights: Understand user engagement with your AI assistant

📈 Accessing Usage Analytics

Finding the API Usage Tab

In your WordPress dashboard, navigate to Settings > Aaia XP and click on the “API Usage” tab. This dedicated section provides comprehensive monitoring tools and detailed analytics about your API consumption.

Real-Time Data Updates

The usage monitor automatically tracks every API request made by your chatbot, including request counts, estimated token usage, and calculated costs based on current Gemini pricing models.

💡 Understanding Usage Metrics

Summary Cards

The dashboard displays three key metrics for the last 30 days:

Requests: Total number of API calls made to Gemini. Each user message typically generates one request, though complex interactions may require multiple calls.

Estimated Tokens: Approximate token consumption based on message length and system instructions. Tokens represent the “computational units” that Gemini uses to process your requests.

Estimated Cost: Calculated expenses based on Google’s current pricing structure. This helps you understand the financial impact of your chatbot usage.

📊 Visual Analytics

Daily Request Charts

Interactive charts show your daily request patterns, helping you identify:

Peak Usage Times: When your users are most active
Traffic Patterns: Weekly and monthly trends
Spike Detection: Unusual activity that might indicate bot traffic or viral content

Cost Visualization

Daily cost charts provide financial insights by displaying:

Daily Spending: Track costs on a day-by-day basis
Cost Trends: Identify increasing or decreasing usage patterns
Budget Planning: Project future expenses based on current usage

📋 Detailed Usage Tables

Daily Breakdown

The detailed table shows:

Date: Specific day of usage
Requests: Number of API calls for that day
Estimated Tokens: Token consumption for the day
Estimated Cost: Associated costs in USD

This granular data helps you understand exactly when and how your API quota is being consumed.

⚙️ Optimization Features

Context Cache Integration

When Context Cache is enabled in API Settings, you’ll see significant reductions in token consumption. The cache stores system instructions and frequently used context for 24 hours, reducing:

Repeat Token Usage: System instructions don’t need to be sent with every request
Context Efficiency: Conversation context is cached and reused
Cost Savings: Up to 60% reduction in token consumption for active sites

Session Management Benefits

With Session Management enabled (Beta feature), you can achieve even greater savings:

Short Conversations (5-10 messages): ~30% token reduction
Medium Conversations (10-30 messages): ~70% token reduction
Long Conversations (30+ messages): ~90% token reduction

🔄 Data Management

Refresh Functionality

Use the “Refresh Data” button to:

Update Statistics: Get the latest usage information
Sync Records: Ensure accuracy between local tracking and actual usage
Real-Time Monitoring: Check current consumption levels

Data Retention

Usage data is stored locally in your WordPress database and automatically:

Maintains 30-Day History: Keeps detailed records for analysis
Auto-Cleanup: Removes older data to prevent database bloat
Privacy Protection: All data stays on your server

🚨 Usage Alerts and Limits

Understanding Google’s Limits

Google’s free tier provides:

15 requests per minute: Rate limiting for API calls
1,500 requests per day: Daily quota for free accounts
Generous Token Allowance: Sufficient for most small to medium websites

Monitoring Recommendations

Daily Checks: Review usage patterns regularly to identify trends

Weekly Analysis: Look for unusual spikes or decreases in activity

Monthly Planning: Use historical data to project future needs

Alert Setup: Consider external monitoring if you approach quota limits

💰 Cost Optimization Strategies

Enable Context Caching

Activate Context Cache in API Settings to reduce token consumption by up to 60% for sites with consistent system instructions and page context.

Optimize System Instructions

Keep your system instructions concise but effective. Longer instructions consume more tokens with each request.

Configure Session Management

Enable the beta Session Management feature for dramatic token savings on sites with longer conversations.

Review Assistant Configuration

Regularly assess your chatbot’s settings to ensure optimal performance without unnecessary token consumption.

🔍 Troubleshooting Usage Issues

High Usage Concerns

If you notice unexpectedly high usage:

Check for Bot Traffic: Automated requests can inflate usage Review System Instructions: Long instructions increase token consumption Analyze User Patterns: Look for unusual conversation lengths Verify Rate Limiting: Ensure proper request throttling

Low Usage Analysis

Unexpectedly low usage might indicate:

Configuration Issues: Check if the chatbot is properly displayed User Experience Problems: Verify chatbot accessibility and positioning Content Relevance: Ensure your assistant provides value to visitors

🚀 What’s Next

Understanding your API usage ensures cost-effective operation while maximizing value. Continue to Session Management to learn about advanced token optimization features, or explore Page Context to understand how content integration affects consumption.