Meet the Analytics Lake

Data, metadata, high performance caching, custom services, and analytics all in one platform!

Request a demo Live demo + Q&A
Tap to enlarge Tap to minimize

What is an Analytics Lake?

The Analytics Lake forms a 'composable data service architecture' layer that combines the best in open-source tech and our semantic and analytics layers, to give both human and machine consumers a single place to find all of their analytics assets.

Simplify data access with a unified service layer

No more moving data icon

No more moving data

Reduce data duplication and latency, and ensure real-time access to the most current data where it sits.

Tools to connect and orchestrate icon

Tools to connect and orchestrate

S3 for business intelligence, TimescaleDB for real-time analytics, Snowflake for data warehousing, and Databricks for data science.

Efficient Data Management icon

Efficient Data Management

Ensure that each tool and storage solution delivers the most value without unnecessary data movement or complexity.

Benefits of the Analytics Lake

Use Analytics Lake through your favorite interface — Python for data scientists, React for application developers, SQL for data engineers and, no code/low code and chat interfaces for business users.

Icon
Save money with advanced caching
Super-fast, in-memory computation on large volumes of data — on full or partial data sets — for reusability.
Icon
Transform and federate with agility
Combine data from your apps via SQL or APIs. Materialize data in-memory or on disk — query live from your databases.
Icon
Built on open source tools
Built on tools including Apache Arrow, Iceberg, and DuckDB, pre-configured for core BI use cases.
Icon
Power new use cases
Develop your own modules to meet data science and engineering needs directly in your analytics platform.

Introducing FlexConnect

Write custom self-service integrations to any data source.

Integrate niche or homegrown data sources rapidly, reducing time to value and empowering your team to immediately leverage unique data without bottlenecks.

Build custom data connections

Allow your team to independently build and incorporate any data source into GoodData’s data models, reducing dependency on external support and speeding up deployments.

Designed for self-service control

Connect to virtually any system or data source, including niche or homegrown solutions, using familiar SQL and API standards. This flexibility helps incorporate diverse data assets without complex integration projects.

Leverage SQL and APIs

Embed advanced transformations, data federation, or ML tasks within the connection itself, enabling real-time data insights and a streamlined analytics workflow.

Embedded run-time logic

By embedding business logic and governance mechanisms directly within the data source connection, FlexConnect ensures consistent, trusted data flows and robust security, enhancing the quality and reliability of your insights.

Enhanced control and user trust
Build custom data connections
Designed for self-service control
Leverage SQL and APIs
Embedded run-time logic
Enhanced control and user trust

The unique purpose of an Analytics Lake

The Analytics Lake is designed to integrate with raw data from a data lake and structured data from a data warehouse to provide advanced analytics, comprehensive business intelligence, and accurate predictive insights, driving better decision-making and outcomes.

Analytics Lake
Solution
Primary Focus
Scalable raw data storageBusiness reporting and decision-makingAnalytics and machine learning
Storage Format & Optimization
Optimized for low-cost storage of large raw, unstructured dataStructured data optimized for query performanceOptimized compute for ML workflows
Metadata & Semantic Layers
Limited metadata and semanticsTypically has integrated business metadataIntegrated metadata and headless semantic layer
BI & Visualization Capabilities
Minimal BI supportFull support for BI workflows and visualizationsIntegrated BI and visualizations environment
Advance Analytics Support
Strong support for advanced analyticsBasic predictive modeling capabilitiesOptimized for advanced analytics and ML
Data Science & ML Support
Supports data science through big data toolsLimited support for data science workflowsTailored for data science and ML with compute optimization and APIs
Compliance & Governance
Limited governance capabilitiesStrong auditing, security, and governance capabilitiesRobust governance through metadata integration
APIs & Programmatic Access
Access via big data APIs and notebooksTraditionally accessed through SQLAPIs designed specifically for analytics engineers
Cost Efficiency
Very low-cost storage but unpredictable analytics costsPredictable but relatively high storage costsOptimized processing reduces overall costs
Flexibility & Future Proofing
Flexible but typically requires migration for BI useSchema-on-write limits flexibilityDesigned to interoperate with multiple data platforms
Key Strengths
Scalability, cost efficiency for storagePerformance, consistency, reliabilityPurpose-built for advanced analytics and ML
Key Limitations
Challenging to apply governance, reuse data for BI and reportingLimited flexibility and ability to handle messy, large or streaming dataEmerging architecture
SolutionData LakeData WarehouseAnalytics Lake
Primary FocusScalable raw data storageBusiness reporting and decision-makingAnalytics and machine learning
Storage Format & OptimizationOptimized for low-cost storage of large raw, unstructured dataStructured data optimized for query performanceOptimized compute for ML workflows
Metadata & Semantic LayersLimited metadata and semanticsTypically has integrated business metadataIntegrated metadata and headless semantic layer
BI & Visualization CapabilitiesMinimal BI supportFull support for BI workflows and visualizationsIntegrated BI and visualizations environment
Advance Analytics SupportStrong support for advanced analyticsBasic predictive modeling capabilitiesOptimized for advanced analytics and ML
Data Science & ML SupportSupports data science through big data toolsLimited support for data science workflowsTailored for data science and ML with compute optimization and APIs
Compliance & GovernanceLimited governance capabilitiesStrong auditing, security, and governance capabilitiesRobust governance through metadata integration
APIs & Programmatic AccessAccess via big data APIs and notebooksTraditionally accessed through SQLAPIs designed specifically for analytics engineers
Cost EfficiencyVery low-cost storage but unpredictable analytics costsPredictable but relatively high storage costsOptimized processing reduces overall costs
Flexibility & Future ProofingFlexible but typically requires migration for BI useSchema-on-write limits flexibilityDesigned to interoperate with multiple data platforms
Key StrengthsScalability, cost efficiency for storagePerformance, consistency, reliabilityPurpose-built for advanced analytics and ML
Key LimitationsChallenging to apply governance, reuse data for BI and reportingLimited flexibility and ability to handle messy, large or streaming dataEmerging architecture

Learn more

Learn more about the Analytics Lake vision

Icon
FlexQuery
FlexQuery analytics cache powers the GoodData’s Analytics Lake Learn more about FlexQuery
Icon
FlexCache Calculator
Want to see how much lower your cloud data warehouse costs could be? Try our calculator
Icon
Documentation
Get into the technical specifications Read our documentation
Icon
Product
See how we can help your analytics goals Discover our Product

Find out how the Analytics Lake can improve your Data Management

Request a demo Live demo + Q&A