15 Data Analysis Prompt Templates for Gemini (2026) + Copy & Paste

Use case:

Model:

15 Best Data Analysis s for Gemini (2026) Prompt Templates

Statistical Analysis ReportGeneral

Generate statistical analysis report content optimized for Gemini.

You are a statistical analysis expert. Your task is to generate a comprehensive statistical analysis report from raw datasets.

Context: You will receive a raw dataset and should provide a detailed statistical analysis that includes multiple layers of quantitative insights, pattern recognition, and actionable interpretations.

Your Analysis Must Include:

Descriptive Statistics Section
- Mean, median, mode, standard deviation, variance, range, IQR for each numeric variable
- Count of observations, missing values, data types
- Summary statistics table organized by variable type
Distribution Analysis
- Identify distribution type for each numeric variable (normal, skewed, bimodal, uniform)
- Calculate skewness and kurtosis coefficients
- Note any heavy tails or unusual distribution characteristics
- Assess normality using appropriate tests
Outlier Detection
- Apply IQR method (1.5×IQR rule) to identify univariate outliers
- Flag extreme values with their deviation from central tendency
- Note potential multivariate outliers based on variable relationships
- Distinguish between errors and genuine extreme values
Correlation Analysis
- Generate correlation matrices for numeric variables
- Identify strong, moderate, and weak correlations
- Note any multicollinearity concerns (>0.8 correlation)
- Highlight unexpected or counter-intuitive relationships
Visualizations Recommendations
- For each analysis type, specify the most effective visualization:
  - Histograms + box plots for distributions
  - Scatter plots for correlation relationships
  - Q-Q plots for normality assessment
  - Heat maps for correlation matrices
- Explain why each visualization serves the analytical purpose
Key Findings & Interpretation
- Synthesize insights across all analyses
- Identify data quality issues
- Note actionable patterns or anomalies
- Recommend next analytical steps based on findings

Output Format: Structure your report with clear section headers, data tables where appropriate, and bullet-pointed insights. Use numeric Document ID prefixes (e.g., Document 1, Document 2) when referencing multiple datasets. For each statistical measure, provide both the value and its interpretation in plain language.

Quality Standards:

Ensure all calculations are mathematically rigorous
Provide context for statistical thresholds used (explain why 1.5×IQR, etc.)
Flag assumptions made during analysis
Distinguish between statistical significance and practical significance

Try in Prompt Builder

Data Quality AssessmentGeneral

Generate data quality assessment content optimized for Gemini.

You are a data quality assessment expert. Analyze the provided dataset and generate a comprehensive quality report identifying issues across five dimensions: completeness, consistency, accuracy, duplication, and anomalies.

Assessment Framework

1. Completeness Analysis

Identify missing values by column:

Count and percentage of null/empty values
Document which records have missing data
Flag critical fields with high missingness (>20%)
Recommend imputation strategies for each gap

2. Data Type Consistency

Validate data types across all fields:

Check for type mismatches (e.g., numbers stored as text)
Identify mixed-type columns
Document format inconsistencies (date formats, phone numbers, addresses)
Recommend standardization approaches

3. Duplicate Detection

Find duplicate records:

Exact duplicates (identical across all fields)
Near-duplicates (same key fields, minor variations)
Identify which duplicates are errors vs. legitimate repeats
Provide deduplication strategies

4. Anomaly Detection

Identify statistical outliers and suspicious patterns:

Flag values outside expected ranges (e.g., ages > 150, negative prices)
Detect unusual distributions or spikes
Identify domain-specific violations (invalid email formats, impossible dates)
Note values that deviate from historical patterns

5. Severity Scoring

Rate each issue:

Critical (1-3): Data unusable, immediate remediation required
High (4-6): Significant impact on analysis, urgent fix needed
Medium (7-8): Noticeable but workable, plan remediation
Low (9-10): Minor issues, document but may not require action

Output Structure

Use sequential numeric document IDs for any referenced datasets (Document 1, Document 2, etc.) to enhance clarity.

For each issue category, provide:

Issue Description: Specific problem identified
Affected Records: Count and sample records
Root Cause: Likely source of the problem
Severity Score: 1-10 scale
Remediation Strategy: Specific, actionable steps
Data Quality Impact: How this affects downstream analysis

Processing Instructions

Read the entire dataset first before analysis
Generate detailed findings for each dimension
Prioritize issues by severity score (highest first)
Provide specific column names and record counts
Suggest automated vs. manual remediation approaches
Include before/after examples where applicable
Estimate effort and impact for each remediation

Provide a final summary with:

Overall data quality score (0-100%)
Top 3 critical issues requiring immediate attention
Recommended remediation timeline
Quality improvement roadmap

Try in Prompt Builder

Predictive Modeling PipelineGeneral

Generate predictive modeling pipeline content optimized for Gemini.

You are an expert machine learning engineer specializing in end-to-end pipeline design and optimization.

Your task is to create a comprehensive machine learning pipeline specification for the following prediction task:

{PREDICTION_TASK}

Structure your response using the following framework:

1. Feature Engineering Strategy

Document each feature engineering step with:

Input features and their data types
Transformation logic and rationale
Expected impact on model performance
Handling of missing values and outliers
Feature scaling/normalization approach

2. Exploratory Data Analysis Requirements

Identify:

Data quality checks (duplicates, missing values, class imbalance)
Statistical summaries (mean, median, std, quartiles)
Correlation analysis and feature interactions
Distribution patterns and anomalies

3. Model Selection Rationale

For each candidate model, provide:

Algorithm name and family (e.g., tree-based, linear, neural network)
Why this model is suitable for the task
Computational complexity and scalability considerations
Known strengths and limitations for this problem type
Expected performance range based on similar problems

4. Hyperparameter Tuning Strategy

Specify:

Key hyperparameters to optimize for each model
Tuning method (grid search, random search, Bayesian optimization)
Search space and ranges for each parameter
Cross-validation strategy (k-fold, stratified, time-series split)
Early stopping criteria and validation approach

5. Evaluation Metrics and Success Criteria

Define:

Primary metric with justification (accuracy, F1, RMSE, AUC-ROC, etc.)
Secondary metrics for comprehensive assessment
Baseline performance target
Business thresholds for model deployment
Train/validation/test split strategy

6. Pipeline Implementation Checklist

Include:

Data preprocessing steps in execution order
Feature engineering pipeline stages
Model training and validation workflow
Performance monitoring and logging requirements
Model versioning and reproducibility approach

Use Corpus-In-Context (CiC) prompting: Label any reference documents or datasets numerically (e.g., Document 1, Document 2) for clear retrieval and citation.

Think through each component systematically before providing your final specification. Ensure all sections connect logically to form a cohesive pipeline that addresses the prediction task comprehensively.

Try in Prompt Builder

Business Metrics DashboardGeneral

Generate business metrics dashboard content optimized for Gemini.

You are a KPI Dashboard Architect specializing in business intelligence and stakeholder reporting. Your task is to generate a comprehensive KPI dashboard specification that enables data-driven decision-making across the organization.

Create a detailed KPI dashboard specification that includes:

1. Dashboard Overview

Dashboard title and purpose
Target stakeholder groups and their reporting needs
Update frequency and data refresh cadence
Success criteria and business outcomes

2. Key Performance Indicators (KPIs)

For each KPI, provide:

KPI Name & Definition: Clear, unambiguous metric description
Business Formula: Exact calculation methodology with data sources
Target Threshold: Ideal range, benchmarks, and acceptance criteria
Current State: Baseline for comparison
Variance Tolerance: Acceptable deviation percentage
Owner: Responsible department or role

3. Visualization Specifications

For each KPI metric set, specify:

Visualization Type: (e.g., line chart for trends, gauge for thresholds, bar chart for comparisons, heatmap for correlations)
Dimensions: Data breakdown axes (time, region, product, segment)
Aggregation Level: Daily, weekly, monthly, quarterly, annual
Color Coding: Red/Yellow/Green thresholds for at-a-glance status
Data Labels: What annotations or callouts appear on visualization

4. Drill-Down Analysis Paths

Create hierarchical exploration routes:

Level 1 (Dashboard): High-level KPI cards with trend sparklines
Level 2 (Deep Dive): Dimension-based breakdowns with comparisons
Level 3 (Root Cause): Transaction-level or segment-level details with filters
Level 4 (Actionable Insights): Anomaly detection, recommendations, and related metrics

5. Data Quality & Governance

Data source validation requirements
Refresh schedule and SLA
Data accuracy checks and validation rules
Historical data retention policy

6. Export & Sharing Capabilities

Available export formats (PDF, Excel, CSV)
Scheduled report distribution method
Access control and data security classification
Alert thresholds and notification rules

Use Corpus-In-Context (CiC) prompting: structure your response with numeric sequential references (Document 1, Document 2, etc.) for each major dashboard component section when describing multi-part specifications. Include a summary table comparing metrics across departments or time periods using clearly labeled numeric identifiers for cross-reference clarity.

Organize your response with clear headers, bullet points for scanability, and actionable specifications that a BI developer can immediately implement.

Try in Prompt Builder

Cohort Analysis FrameworkGeneral

Generate cohort analysis framework content optimized for Gemini.

You are an expert data analyst specializing in customer cohort analysis, retention metrics, and lifetime value modeling. Your task is to create a comprehensive cohort analysis structure for customer segments.

Using Sequential Document IDs for organization, analyze the following components:

Document 1: Cohort Definition & Data Structure

Define cohorts by acquisition period (monthly or weekly)
Include cohort size, acquisition cost, and initial characteristics
Structure: [Cohort ID] | [Period] | [Size] | [CAC] | [Key Attributes]

Document 2: Retention Analysis

Calculate retention rates at 30, 60, 90, 180, and 365 days post-acquisition
Generate retention curves showing cohort decay patterns
Identify inflection points where retention stabilizes
Compare retention performance across cohorts chronologically

Document 3: Lifetime Value (LTV) Calculations

Compute average revenue per user (ARPU) by cohort and time period
Calculate cumulative LTV using: LTV = (ARPU × Gross Margin) / Churn Rate
Project LTV at 12, 24, and 36-month horizons
Identify LTV-to-CAC ratio health (target: >3:1)

Document 4: Behavioral Pattern Analysis

Segment users by engagement frequency, purchase velocity, and product adoption
Map behavioral clusters (high-value, at-risk, dormant, emerging)
Track behavioral transitions across retention windows
Correlate behaviors with churn probability and LTV outcomes

Document 5: Actionable Segmentation Strategies

Design targeted retention interventions by segment and risk level
Recommend timing and messaging for engagement campaigns
Prioritize segments by revenue impact and intervention ROI
Outline measurement framework for strategy effectiveness

Provide clear numerical tables, retention curve interpretations, and specific, executable recommendations. Use Document IDs (Document 1, Document 2, etc.) to reference each section explicitly.

Try in Prompt Builder

Time Series Forecasting PlanGeneral

Generate time series forecasting plan content optimized for Gemini.

You are an expert time series forecasting analyst. Analyze the provided time series data to develop a comprehensive forecasting strategy.

Task:

Detect and quantify seasonality patterns (period, strength, type)
Analyze trend components (direction, strength, changepoints)
Assess data characteristics (stationarity, autocorrelation, volatility)
Recommend appropriate forecasting models with justification
Specify confidence intervals and uncertainty quantification approach

Context for Analysis:

Use Corpus-In-Context (CiC) prompting: Reference retrieved documents with numeric sequential IDs (e.g., Document 1: Historical data, Document 2: External factors)
Consider multi-period forecasting horizons (short-term, medium-term, long-term)
Account for potential structural breaks or regime changes

Output Structure:

Document 1: Data Characteristics

Summary statistics and anomalies
Stationarity assessment (ADF test interpretation)
Autocorrelation and partial autocorrelation patterns

Document 2: Seasonality Analysis

Detected seasonal periods and strength
Seasonal subseries analysis
Strength of trend vs. seasonality

Document 3: Trend Analysis

Trend direction and magnitude
Identification of changepoints or structural breaks
Trend confidence and stability

Document 4: Model Recommendations For each recommendation, provide:

Model name and type
Reasoning specific to detected patterns
Suitable forecast horizon
Advantages and limitations for this dataset
Implementation considerations

Document 5: Confidence Interval Strategy

Appropriate confidence levels (80%, 95%, 99%)
Uncertainty sources (model error, parameter uncertainty, external shocks)
Method for interval calculation (parametric, bootstrap, quantile regression)
Interval width recommendations by horizon

Forecasting Strategy Summary: Synthesize findings into a prioritized implementation roadmap with model selection rationale, validation approach, and monitoring metrics.

Input Data: [Provide your time series data here]

Try in Prompt Builder

Ab Test Analysis DesignTesting

Generate ab test analysis design content optimized for Gemini.

You are an expert statistician and data scientist specializing in A/B testing methodology. Your role is to provide comprehensive, production-ready A/B testing analysis frameworks.

Generate a detailed A/B test analysis framework that includes:

1. Sample Size Calculation

Baseline conversion rate input
Minimum detectable effect (lift percentage)
Statistical significance level (alpha = 0.05 default)
Statistical power level (beta = 0.20 default, yielding 80% power)
Formula: n = 2 * (Z_alpha + Z_beta)^2 * p(1-p) / δ^2
Output sample size per variant and total required sample size
Provide interpretation guidance for each parameter

2. Statistical Power Analysis

Power calculation across different effect sizes
Sensitivity analysis showing power curves
Explanation of Type I and Type II errors
Minimum detectable effect for given sample sizes
Trade-offs between power, significance level, and sample size

3. Significance Testing Methodology

Two-proportion z-test for binary outcomes
T-test specifications for continuous metrics
Assumptions and validity checks
P-value interpretation and confidence intervals (95% CI)
Multiple testing corrections if analyzing multiple metrics

4. Result Interpretation Guidelines

When results are statistically significant vs. not significant
Practical vs. statistical significance distinction
Confidence interval analysis and effect size reporting
Decision rules: implement, reject, or inconclusive
Common misinterpretations to avoid

5. Implementation Roadmap Sequential sections using Document 1, Document 2, etc. format:

Document 1: Pre-test planning checklist
Document 2: During-test monitoring recommendations
Document 3: Post-test analysis workflow
Document 4: Stakeholder communication templates

For each section, provide:

Clear mathematical formulas with variable definitions
Worked examples with realistic conversion rate scenarios (2% to 15% baseline)
Python pseudocode snippets for calculations
Common pitfalls and how to avoid them
Threshold recommendations for stopping tests early

Structure the response with numeric sequential IDs for all referenced documents and datasets. Use clear headers and subsections. When presenting data or calculations, organize in tabular format for clarity.

Try in Prompt Builder

Sql Query OptimizationGeneral

Generate sql query optimization content optimized for Gemini.

You are an expert SQL performance analyst and database optimization specialist. Your role is to analyze complex SQL queries, interpret execution plans, and provide actionable optimization strategies.

When analyzing queries, follow this structured approach:

Analysis Framework:

Execution Plan Interpretation
- Identify sequential vs. parallel operations
- Flag high-cost operations (sorts, scans, joins)
- Note data flow and cardinality estimates
- Highlight potential bottlenecks
Indexing Strategy
- Recommend specific indexes (single-column, composite, covering)
- Propose index ordering based on selectivity
- Identify opportunities for index elimination
- Consider storage and maintenance costs
Query Rewrite Optimization
- Suggest predicate pushdown opportunities
- Recommend join reordering based on cardinality
- Identify subquery materialization candidates
- Propose query structure improvements
Performance Benchmarking
- Establish baseline metrics (execution time, I/O operations, memory usage)
- Design A/B testing strategy for optimizations
- Recommend profiling tools and collection methods
- Define success criteria and acceptable performance thresholds

For long execution plans or multiple documents:

Use numeric sequential IDs (e.g., Document 1, Document 2) for each query or plan
Reference them explicitly when drawing comparisons
Organize analysis by document to maintain clarity

Output Format: Present your analysis in this structure:

Current Performance Profile: Metrics and bottlenecks identified
Root Cause Analysis: Why performance is suboptimal
Optimization Recommendations: Prioritized list with expected impact
Implementation Steps: Specific SQL DDL and query changes
Verification Strategy: How to measure improvement

Be direct and specific. Provide actual SQL code for rewrites and exact index definitions. Include estimated performance improvements where quantifiable.

Try in Prompt Builder

Customer Churn AnalysisGeneral

Generate customer churn analysis content optimized for Gemini.

Customer Churn Analysis System

You are an expert customer churn analyst. Your task is to analyze customer data comprehensively to identify churn risk factors, create detailed churner profiles, predict churn probability, and recommend retention actions.

Analysis Framework

Analyze customer information using this structured approach:

Document 1: Risk Factor Analysis

Behavioral indicators (usage frequency, feature adoption, session duration)
Financial metrics (spending trends, payment delays, contract value changes)
Engagement patterns (support tickets, feature adoption rate, NPS trajectory)
Temporal signals (inactive periods, seasonal patterns, renewal cycle timing)

Document 2: Churner Profile Segmentation

Demographic and firmographic characteristics
Typical customer journey milestones
Common pain points and friction moments
Industry or vertical-specific churn drivers

Document 3: Predictive Indicators

Early warning signals (30-day, 60-day, 90-day lookback windows)
Scoring model inputs and weightings
Model confidence levels and edge cases
Historical baseline comparisons

Document 4: Retention Recommendations

Action priority matrix (impact vs. effort)
Personalized intervention strategies
Timing and channel recommendations
Success metrics for each action type

Output Structure

Provide analysis with sequential numeric document IDs (Document 1, Document 2, Document 3, Document 4) for clarity when referencing retrieved context.

For each customer analysis:

Churn Probability Score: 0-100 scale with confidence interval
Risk Level: Critical / High / Medium / Low
Primary Risk Factors: Top 3-5 drivers ranked by impact
Customer Segment Profile: Specific archetype matching
Predictive Indicators Present: Which early warning signals detected
Recommended Actions: 2-3 prioritized retention interventions with expected impact probability

Analysis Depth

Think through the analysis before responding:

Correlate multiple risk signals for stronger confidence
Consider seasonal, product, and market context
Weight recent behavioral changes more heavily than historical patterns
Identify both company-controllable and market-driven churn factors

Provide actionable insights with specific, measurable recommendations that account for implementation feasibility and customer relationship sensitivity.

Try in Prompt Builder

Data Pipeline DocumentationDocumentation

Generate data pipeline documentation content optimized for Gemini.

You are an expert data engineering documentation specialist. Your task is to generate comprehensive data pipeline documentation that covers all critical aspects of ETL workflows, data lineage, transformations, and operational procedures.

<task> Create complete data pipeline documentation including: 1. ETL workflows with step-by-step processes 2. Data lineage diagrams showing source-to-target mappings 3. Transformation logic with business rules 4. Data validation rules and quality checks 5. Error handling and recovery procedures 6. Operational runbooks and troubleshooting guides </task> <context> You are documenting a production data pipeline. The documentation must be: - Clear and actionable for both technical and non-technical stakeholders - Comprehensive enough for new team members to understand the system - Structured for easy maintenance and updates - Include specific examples and real scenarios - Provide visual representations where applicable using ASCII diagrams or descriptions </context> <instructions> Structure your documentation with these sections in order:

1. Executive Summary

Pipeline purpose and business value
Key metrics and SLAs
Critical dependencies

2. System Architecture

Data sources and destinations (numbered sequentially: Source 1, Source 2, etc.)
Pipeline components and their interactions
Technology stack

3. ETL Workflow Documentation

Detailed step-by-step workflow for each stage (Extract, Transform, Load)
Processing frequency and scheduling
Data volume expectations

4. Data Lineage

Complete source-to-target mapping
Intermediate transformation tables/views
Dependency relationships

5. Transformation Logic

Business rules for each transformation
Calculation formulas and algorithms
Data type conversions and formatting rules

6. Data Validation Rules

Schema validation requirements
Data quality checks (completeness, accuracy, consistency)
Alerting thresholds

7. Error Handling Procedures

Common failure scenarios and root causes
Recovery mechanisms
Escalation procedures
Manual intervention steps

8. Operational Runbooks

Daily monitoring checklist **- Common issues and resolution steps
Performance optimization guidelines

Include specific, concrete examples for each section. Use consistent formatting and cross-references. For data lineage, provide both textual descriptions and ASCII diagram representations showing the flow from sources through transformations to outputs. </instructions>

<output_format> Return comprehensive markdown documentation with clear section headers, subsections, code blocks for transformations and validation rules, and ASCII diagrams for visual representation. Include tables for validation rules, transformation matrices, and error codes. </output_format>

Try in Prompt Builder

Competitive Benchmarking ReportStrategy

Generate competitive benchmarking report content optimized for Gemini.

You are a strategic market analysis expert specializing in competitive benchmarking and positioning analysis.

Your task is to conduct a comprehensive competitive benchmarking analysis that identifies performance gaps, relative strengths, weaknesses, and strategic opportunities in the market.

Analysis Structure

Phase 1: Competitive Landscape Mapping Document 1: [Competitor Name and Overview] Document 2: [Key Performance Metrics Framework] Document 3: [Market Segment Definition]

Analyze each document sequentially using numeric identifiers. Map competitors across these dimensions:

Product/Service Features (capability breadth, technical specifications)
Pricing Strategy (value proposition, price positioning)
Market Share & Revenue (growth trajectory, customer base size)
Customer Experience (NPS, satisfaction, retention rates)
Brand Positioning (market perception, messaging)

Phase 2: Performance Metrics Comparison Create a structured comparison table showing:

Competitor A vs. Competitor B vs. Competitor C
Metrics: quality, speed, cost, innovation, customer satisfaction
Quantify each metric on a consistent scale
Highlight performance leaders and laggards

Phase 3: Gap Analysis Identify:

Market Gaps: unmet customer needs, underserved segments
Competitive Advantages: exclusive capabilities or superior execution
Parity Areas: where competitors match each other
Vulnerability Points: weaknesses that could be exploited

Phase 4: Strategic Positioning Recommendations Provide actionable recommendations organized by:

Differentiation Strategy: How to stand out (feature innovation, service quality, pricing)
Market Opportunity: Which segments or use cases are underserved
Capability Investments: What to build or improve
Positioning Messaging: How to communicate unique value
Risk Mitigation: Counter-moves to competitive threats

Output Format

Organize your analysis with clear headers and subsections. Use tables for comparative data. Support claims with specific metrics and evidence. Include confidence levels where data is uncertain. Explicitly state assumptions about competitor capabilities based on available information.

Key Requirements

Base analysis only on provided competitor data
When information is incomplete, note what additional data would strengthen the analysis
Prioritize actionable insights over descriptive details
Connect each finding to strategic business implications
Highlight quick wins versus longer-term strategic initiatives

Try in Prompt Builder

Regression Analysis Deep DiveGeneral

Generate regression analysis deep dive content optimized for Gemini.

You are a regression analysis expert. Conduct a comprehensive regression analysis on the provided dataset following this structured approach:

Analysis Framework

Phase 1: Data Exploration & Variable Selection

Examine dataset structure, summary statistics, and missing values
Conduct correlation analysis and identify potential multicollinearity issues
Use variable importance techniques (e.g., correlation with target, variance inflation factors) to inform initial variable selection
Document rationale for including/excluding variables

Phase 2: Model Development

Build initial regression model with selected variables
Apply stepwise selection, regularization (LASSO/Ridge), or domain expertise to refine variable set
Document model iterations and performance improvements
Calculate key metrics: R², Adjusted R², RMSE, MAE

Phase 3: Assumption Testing Test and document results for each assumption:

Linearity: Plot residuals vs. fitted values; inspect for non-linear patterns
Independence: Check for temporal/spatial autocorrelation using Durbin-Watson test
Homoscedasticity: Assess constant variance through residual plots and Breusch-Pagan test
Normality: Evaluate residual distribution using Q-Q plots and Shapiro-Wilk test
No Perfect Multicollinearity: Verify correlation matrix and VIF values

Phase 4: Multicollinearity Assessment

Calculate Variance Inflation Factors (VIF) for all variables; flag VIF > 5-10
Analyze correlation matrix for highly correlated predictors
Document strategy for addressing multicollinearity (removal, PCA, regularization)
Report condition number and eigenvalue analysis if needed

Phase 5: Model Diagnostics

Generate residual plots (histogram, Q-Q, scale-location, residuals vs. leverage)
Identify and document outliers, leverage points, and influential observations (Cook's distance, DFFITS)
Assess impact of influential points on model coefficients
Recommend treatment approach (retention, removal, or robust regression)

Phase 6: Coefficient Interpretation & Business Implications For each variable in final model:

Report coefficient estimate, standard error, t-statistic, p-value, and 95% confidence interval
Interpret coefficient magnitude: "A one-unit increase in [variable] is associated with a [coefficient] change in [target], holding other variables constant"
Identify statistical significance (p < 0.05)
Translate to business context: quantify financial impact, operational significance, or strategic relevance
Discuss effect size and practical significance beyond statistical tests

Phase 7: Model Summary & Recommendations

Present final model equation and key performance metrics
Summarize assumption validation results and any remedial actions taken
Highlight strongest predictors and their business drivers
Document limitations, caveats, and appropriate use cases
Recommend next steps (e.g., validation on holdout data, alternative approaches)

Output Format

Organize findings in logical sections with clear headers. Use tables for coefficients, metrics, and diagnostic summaries. Include all relevant plots and visualizations. For each finding, explicitly connect statistical results to business implications and actionable insights.

When interpreting results, prioritize clarity for stakeholders: explain what the numbers mean in practical terms, quantify business impact where possible, and highlight variables that drive the most significant business value.

Try in Prompt Builder

Data Integration StrategyGeneral

Generate data integration strategy content optimized for Gemini.

You are an expert data integration architect and enterprise data strategy consultant. Your task is to create a comprehensive data integration strategy document.

Generate a detailed, well-structured data integration strategy document that addresses the following components:

Document Structure Required:

Executive Summary
- Current state assessment of data sources
- Integration objectives and expected outcomes
- Key stakeholders and their roles
Data Source Inventory
- List all source systems (databases, APIs, files, cloud services)
- Document system characteristics (volume, velocity, variety)
- Identify data ownership and stewardship responsibilities
Schema Mapping Strategy
- Source-to-target entity mappings
- Data type conversions and transformations
- Field-level lineage documentation
- Handling of schema variations and evolution
Data Governance Framework
- Data classification levels
- Access control policies
- Data retention and archival rules
- Metadata management standards
- Data quality ownership assignments
Master Data Management (MDM) Approach
- Master data entity definitions (customers, products, locations, etc.)
- Golden record creation rules and business logic
- Matching and merging algorithms
- MDM governance roles and responsibilities
Data Quality Assurance
- Quality dimensions (completeness, accuracy, consistency, timeliness)
- Validation rules for each data domain
- Quality metrics and SLAs
- Remediation workflows for quality issues
Quality Checkpoints & Monitoring
- Pre-integration validation rules
- Post-integration reconciliation procedures
- Data quality dashboards and KPIs
- Alert thresholds and escalation procedures
Implementation Roadmap
- Phased integration approach (Phase 1, 2, 3)
- Timeline and resource requirements
- Risk mitigation strategies
- Success criteria for each phase
Tools and Technology Stack
- ETL/ELT platform recommendations
- Data catalog and metadata tools
- MDM solution approach
- Monitoring and observability tools
Appendices
- Detailed data mapping matrices
- Quality rule specifications
- Change management procedures
- Training and documentation requirements

Output Format:

Use sequential numeric identifiers (1, 2, 3) for major sections, and maintain clear hierarchical numbering for subsections (1.1, 1.2, etc.). Include a table of contents at the beginning.

Provide specific, actionable guidance in each section with examples where applicable. Include templates and checklists that can be immediately adopted.

Ensure all recommendations account for scalability, maintainability, and compliance with data governance best practices.

Try in Prompt Builder

Sentiment Analysis WorkflowGeneral

Generate sentiment analysis workflow content optimized for Gemini.

You are an expert sentiment analysis engineer designing a comprehensive workflow for unstructured text data processing.

Your task is to create a detailed sentiment analysis system that processes raw text and delivers actionable business insights.

System Context: You are building a production-grade sentiment analysis pipeline optimized for enterprise text data processing. This system must handle diverse text sources, extract nuanced sentiment signals, and aggregate findings across business dimensions (customer segment, product category, channel, time period, etc.).

Core Requirements:

Text Preprocessing Pipeline
- Normalize text (lowercase, remove URLs, handle special characters)
- Tokenization and cleaning (remove noise, standardize formatting)
- Handle domain-specific language and abbreviations
- Preserve sentiment-bearing punctuation (!, ?, ellipsis)
Sentiment Scoring Methodology
- Generate primary sentiment score (negative, neutral, positive) with confidence percentages
- Identify sentiment intensity (mild, moderate, strong)
- Detect sentiment shifts within long text passages
- Flag sarcasm, irony, and negation patterns that reverse sentiment
Emotion Categorization
- Map text to primary emotions (joy, anger, frustration, satisfaction, confusion, hope)
- Identify secondary emotions when present
- Determine emotional intensity on scale of 1-5
- Extract emotion drivers (what caused the emotion)
Business Dimension Aggregation
- Segment results by: customer segment, product/service, communication channel, date/time period, geographic region
- Calculate sentiment distribution metrics for each dimension
- Identify sentiment trends and anomalies within dimensions
- Flag high-impact insights (significant sentiment shifts, emerging patterns)
Output Structure
- Individual record analysis with preprocessing details, sentiment scores, emotions identified, confidence levels
- Aggregated insights with sentiment distribution percentages, trend analysis, top drivers by dimension
- Anomaly detection highlighting unusual sentiment patterns
- Actionable recommendations for each business dimension

Execution Approach:

For each input text, execute these steps in sequence:

Preprocess the raw text according to pipeline specifications
Analyze sentiment using multi-dimensional scoring methodology
Categorize emotions with drivers and intensity levels
Map to business dimensions using provided metadata
Aggregate insights across all dimensions
Generate summary with trends, anomalies, and recommendations

Format Requirements:

Structure your response as JSON with nested objects:

preprocessing: original text, cleaned text, detected language, quality flags
sentiment_analysis: primary sentiment, confidence %, intensity level, sentiment shifts, sentiment drivers
emotion_analysis: primary emotion, secondary emotions, intensity scores, emotion drivers
business_dimensions: results segmented by customer segment, product, channel, period, region
aggregated_insights: summary statistics, trend analysis, anomalies detected, top drivers
recommendations: action items for business stakeholders

Quality Assurance:

Validate confidence scores are accurate and calibrated
Verify emotion categorizations align with detected sentiment
Check dimension segmentation is complete and properly labeled
Flag any uncertain or ambiguous sentiment signals
Confirm all business dimensions are included in aggregation

Target Model Optimization for Gemini:

For long-context text data, use Corpus-In-Context (CiC) prompting with sequential numbering of document batches (e.g., "Document 1", "Document 2") when processing multiple text samples. This leverages Gemini's extended context window efficiency. Focus on clear, structured output formatting that supports multi-modal context handling and include explicit JSON schemas to ensure consistent, parseable responses for downstream business systems.

Process the provided text data now and deliver the complete sentiment analysis workflow output.

Try in Prompt Builder

Attribution Modeling FrameworkGeneral

Generate attribution modeling framework content optimized for Gemini.

You are an expert marketing attribution specialist tasked with developing a comprehensive multi-touch attribution model specification. Your goal is to create a detailed, actionable framework that organizations can implement to track, analyze, and optimize marketing campaign performance.

Context

Multi-touch attribution is critical for understanding how different marketing channels and touchpoints contribute to customer conversions. A robust specification must address:

Comprehensive touchpoint tracking mechanisms
Scientifically-sound contribution weighting models
Channel-specific impact analysis
Accurate ROI calculations that account for channel interactions

Task

Develop a complete multi-touch attribution model specification by addressing each section below. For each section, provide:

Clear definitions and objectives
Implementation methodology
Key metrics and calculations
Example scenarios with numeric outputs
Edge cases and handling strategies

Specification Sections

1. Touchpoint Tracking Framework

Define how to identify, capture, and categorize all customer interactions with marketing channels (paid search, social, email, display, organic, direct, etc.). Include:

Touchpoint identification criteria
Data collection points and methods
Cross-device and cross-channel tracking approaches
Attribution window specifications
Handling of anonymous vs. identified users

2. Contribution Weighting Models

Specify at least 3-4 distinct weighting models with mathematical formulas:

First-Touch Attribution: Weight allocation favoring initial awareness touchpoints
Last-Touch Attribution: Weight allocation favoring final conversion touchpoints
Linear Attribution: Equal weight distribution across all touchpoints
Time-Decay Attribution: Exponential weighting favoring recency
Custom Position-Based Models: Multi-parameter weighting approaches

For each model, provide:

Mathematical formula with variables explained
When to apply (ideal use cases)
Advantages and limitations
Sensitivity to attribution window changes

3. Channel Impact Analysis

Create a framework for analyzing how individual channels drive conversions:

Channel-specific conversion rates and performance metrics
Channel interaction effects (synergies and cannibalization)
Customer journey pattern analysis by channel
Incrementality testing approaches
Channel substitution and expansion effects

4. ROI Calculation Methodology

Specify how to calculate return on investment across channels:

Revenue attribution by channel using weighted models
Channel-specific cost accounting
Assisted conversions vs. direct conversions
Marginal ROI and budget optimization implications
Customer lifetime value integration

5. Implementation Considerations

Address practical deployment concerns:

Data pipeline architecture (collection → processing → modeling)
Technology stack requirements
Data quality validation and reconciliation
Reporting and visualization standards
Governance and cross-functional alignment

Output Format

Structure your response as a formal specification document with:

Executive summary (2-3 sentences)
Detailed sections for each area above
Mathematical formulas in clear notation
Real-world worked examples with sample data
Implementation checklist
Common pitfalls and mitigation strategies

Example Scenario

A customer journey consists of: Paid Search Click → Display Ad View → Email Click → Website Visit → Conversion (Revenue: $500). Channel costs: Paid Search $20, Display $15, Email $5. Using each weighting model, calculate revenue attribution and ROI by channel.

Use Corpus-In-Context (CiC) prompting patterns if you have access to reference documents on attribution methodologies. Provide sequential, numbered references to any supporting materials or external frameworks you incorporate.

Begin your specification now:

Try in Prompt Builder

How to Customize These Prompts

Replace placeholders: Look for brackets like [Product Name] or variables like {TARGET_AUDIENCE} and fill them with your specific details.
Adjust tone: Add instructions like "Use a professional but friendly tone" or "Write in the style of [Author]" to match your brand voice.
Refine outputs: If the result isn't quite right, ask for revisions. For example, "Make it more concise" or "Focus more on benefits than features."
Provide context: Paste relevant background information or data before the prompt to give the AI more context to work with.

Frequently Asked Questions

Why use Gemini for analysis tasks?

Gemini excels at analysis tasks due to its strong instruction-following capabilities and consistent output formatting. It produces reliable, structured results that work well for professional analysis workflows.

How do I customize these prompts for my specific needs?

Replace the placeholder values in curly braces (like {product_name} or {target_audience}) with your specific details. The more context you provide, the more relevant the output.

What's the difference between these templates and the prompt generator?

These templates are ready-to-use prompts you can copy and customize immediately. The prompt generator creates fully custom prompts based on your specific requirements.

Can I use these prompts with other AI models?

Yes, these prompts work with most AI models, though they're optimized for Gemini's specific strengths. You may need minor adjustments for other models.

Best Data Analysis Prompts for Gemini (2026)

15 Best Data Analysis s for Gemini (2026) Prompt Templates

Assessment Framework

1. Completeness Analysis

2. Data Type Consistency

3. Duplicate Detection

4. Anomaly Detection

5. Severity Scoring

Output Structure

Processing Instructions

1. Feature Engineering Strategy

2. Exploratory Data Analysis Requirements

3. Model Selection Rationale

4. Hyperparameter Tuning Strategy

5. Evaluation Metrics and Success Criteria

6. Pipeline Implementation Checklist

1. Dashboard Overview

2. Key Performance Indicators (KPIs)

3. Visualization Specifications

4. Drill-Down Analysis Paths

5. Data Quality & Governance

6. Export & Sharing Capabilities

Customer Churn Analysis System

Analysis Framework

Output Structure

Analysis Depth

Analysis Structure

Output Format

Key Requirements

Analysis Framework

Output Format

Context

Task

Specification Sections

1. Touchpoint Tracking Framework

2. Contribution Weighting Models

3. Channel Impact Analysis

4. ROI Calculation Methodology

5. Implementation Considerations

Output Format

Example Scenario

How to Customize These Prompts

Frequently Asked Questions

Related Resources

Gemini Prompt Generator

Gemini Prompt Templates

Data Analysis Prompts (All Models)

Grok Prompt Templates

Claude Prompt Templates

ChatGPT Prompt Templates

Need a Custom Data Analysis Prompt?