Dataaaaa !

Une plateforme pour les réunir toutes

Today
Logo

USA Bobsled/Skeleton Partners with Snowflake Intelligence

Snowflake partners with USA Bobsled/Skeleton to enhance performance using data analytics and AI, leveraging Snowflake Intelligence for real-time insights.

Logo

Snowflake-managed MCP server (General availability)

Snowflake announces the general availability of its managed MCP server, providing standardized integration and robust governance for AI agents.

Why we're making nao Free

nao offers a free version of its data IDE integrated with AI features, allowing to connect data warehouses and execute SQL.

Logo

Snowflake Intelligence (General availability)

Snowflake announces the general availability of Snowflake Intelligence, a powerful tool for analyzing structured and unstructured data.

Meet the founders of Columnar: Ian Cook, David Li, and Matt Topol

Columnar, founded by Apache Arrow contributors, introduces Arrow-native ADBC drivers to enhance data connectivity for platforms like Snowflake and DuckDB.

Logo

Cortex Agents (General availability)

Snowflake announces the general availability of Cortex Agents, a tool for orchestrating structured and unstructured data using LLMs and key components like planning and analysis.

Logo

Snowflake Machine Learning Experiments (Preview)

Snowflake introduces machine learning experiments to track and evaluate models through Snowsight, enabling comparison of collected data to select the best model.

Meet the founders of dltHub: Matthaus Krzykowski, Marcin Rudolf, Adrian Brudaru, and Anna Hoffmann

dltHub develops a Python-native data platform to accelerate data pipelines, combining simplicity with enterprise-grade governance.

Yesterday
Logo

FlinkSketch: Democratizing the Benefits of Sketches for the Flink Community

FlinkSketch is a library of sketching algorithms for Flink, enabling various streaming analytics capabilities through efficient algorithms.

Logo

Chat with your Snowflake Data from Microsoft Teams

Integration of Snowflake Cortex with Microsoft Teams to interact with data via a bot, using AI agents and semantic views.

Apache Arrow's Final Frontier: Replacing Outdated Database Drivers

Apache Arrow aims to replace outdated database drivers, enhancing interoperability and performance of modern data systems.

Logo

marimo: A reactive notebook for Python

marimo is a reactive Python notebook for running reproducible experiments, querying with SQL, executing as a script, deploying as an app, and versioning with git.

Logo

LLM Client, Server API and UI

Lightweight tool to access multiple LLMs, with multi-provider support, OpenAI-compatible API, and UI interface. Features include cost analysis, configuration management, and Docker support.

Logo

Faster root cause for slow traces with ClickStack Event Deltas

ClickStack Event Deltas speeds up root cause analysis for slow traces by automatically comparing attributes of fast and slow traces, leveraging ClickHouse for high-performance observability.

"You Don't Need Kafka, Just Use Postgres" Considered Harmful

Technical comparison between Kafka and Postgres for real-time data processing, highlighting Kafka's advantages for event streaming.

Logo

BigQuery : The Data Engineering Agent is now in preview

BigQuery introduces a Data Engineering Agent in preview, automating complex tasks.

Logo

Global weather data from flying airplanes

Using ClickHouse to analyze weather data from airplane telemetry, leveraging color space conversion functions and trigonometric calculations.

Sunday, November 2
Logo

shed: CLI to manage your SQL database schemas and migrations

CLI tool for database schema management using SQLModel and Alembic, with JSON-schema export for Pydantic.

Saturday, November 1

Graph RAG vs SQL RAG

Comparison of RAG performance on graph and SQL databases using a Formula 1 results dataset.

FastMCP 2.13: Storage, Security, and Scale

FastMCP 2.13 introduces persistent storage, robust authentication, and performance optimizations for production MCP servers.

Friday, October 31
Logo

BigQuery : October 31, 2025

Increased row capacity for pivot tables in Connected Sheets from 100,000 to 200,000 rows.

Logo

Organization-level findings in the Trust Center

Snowflake announces security features in Trust Center to analyze violations at the organization level.

Logo

Cool stuff Google Cloud customers built, Oct. edition: Research agents, agentic “teams,” decentralized contracts & more

Showcase of Google Cloud customer projects: AI research agents for Deutsche Bank, migration to CloudSQL for Rent the Runway, and AI assistants for Seattle Children's and FOX Sports.

Logo

Optimize Storage Costs and Simplify Compliance with Storage Lifecycle Policies, Now Generally Available

Snowflake announces general availability of storage lifecycle policies to optimize costs and simplify compliance.

Logo

Turn Data Into Intelligence In Your Everyday Workflows

Snowflake Cortex Agents simplifies AI-powered data interactions within Microsoft 365 Copilot and Teams, enabling analysis and insight generation from structured and unstructured data.

Thursday, October 30
Logo

Snowflake Data Clean Rooms updates

Snowflake Data Clean Rooms updates with UI enhancements, API improvements, and better error messaging.

Logo

4 Senior Data Engineers Answer 10 Top Reddit Questions

Four senior data engineers address Reddit's top questions on fundamentals, data quality, and tech choices.

Logo

Data transformation in the data warehouse

This post explores the importance of data transformation in data warehouses and how dbt facilitates the creation of reliable, scalable data pipelines.

Exploring how PostgreSQL 18 conquered time with temporal constraints

PostgreSQL 18 introduces temporal constraints with WITHOUT OVERLAPS and PERIOD to enhance temporal data integrity.

Logo

BigQuery : Apache Iceberg REST catalog in BigLake metastore now generally available

The Apache Iceberg REST catalog in BigLake metastore is now generally available with new features.

Machine-learning predictive autoscaling for Flink

Grab uses machine learning for predictive autoscaling of Flink applications, optimizing CPU usage and reducing costs.

Logo

Dagster 1.12: Monster Mash

Dagster 1.12 enhances user experience with a streamlined UI, GA Components, simplified deployments, and orchestration improvements like FreshnessPolicies.

Logo

Getting started with an ELT pipeline

Explores ELT pipelines and their scalable design, highlighting dbt's role in simplifying collaboration and data transformation.

Logo

Announcing Expanded Integration Between Oracle Database and the Snowflake AI Data Cloud

Snowflake announces expanded integration with Oracle Database to enhance data connectivity and analytics.

Logo

Improve logs compression with log clustering

This post demonstrates using log clustering with Drain3 and ClickHouse UDFs to automatically structure raw application logs, achieving nearly 50x compression.

Why You’ll Never Have a FAANG Data Infrastructure and That’s the Point | Part 1

Analysis of FAANG data infrastructures, highlighting their design philosophies rather than tools, and proposing a hybrid approach for non-FAANG organizations.

Wednesday, October 29
Logo

CLIENT_POLICY parameter for authentication policies

Snowflake introduces the CLIENT_POLICY parameter to set minimum client versions in authentication policies.

Announcing Columnar

Launch of Columnar, a new open-source platform for data management.

Logo

BigQuery : Groupement de réservations pour la priorisation des slots inactifs

BigQuery now allows grouping reservations to prioritize idle slot sharing within a group, providing better control over slot allocation for high-priority workloads.

Introducing dbc

Introduction to dbc, a command-line tool that manages connections and executes SQL queries.

dbt Labs Open Sources MetricFlow: An Independent Schema for Data Interoperability

dbt Labs open sources MetricFlow, an independent schema for data interoperability, enhancing consistency and collaboration in data pipelines.

Logo

Snowflake Native Apps: Shareback

Snowflake Native Apps can now securely request permission from consumers to share data back with the provider.

The Case Against PGVector

Analysis of operational challenges and limitations of pgvector in production, highlighting indexing issues, real-time search problems, and filtering complexities.

Tuesday, October 28
Logo

We built a vector search engine that lets you choose precision at query time

ClickHouse introduces QBit, a column type for storing floats as bit planes, enabling adjustable precision and performance for vector searches at query time.

Logo

Faster Ducks

MotherDuck showcases 20% performance improvements with DuckDB 1.4, outperforming Snowflake and Redshift in cost and speed according to ClickBench.

Logo

BigQuery : October 28, 2025

BigQuery Data Transfer Service supports new data sources. Subscriber email logging is now available.

Logo

Snowflake Cortex AI Achieves FedRAMP Moderate on AWS Region US East (N. Virginia)

Snowflake Cortex AI achieves FedRAMP Moderate authorization on AWS US East region, enhancing security and compliance for government users.

Logo

Snowflake and Workday Accelerate the AI-Driven Enterprise

Partnership between Snowflake and Workday to accelerate AI adoption in the enterprise.

Logo

Integrating Oracle with Google Cloud for AI automation

Integrating Oracle with Google Cloud for AI automation. Using Datastream for data replication and BigQuery for advanced analytics and AI.

Logo

Snowflake Security Innovations for a Trusted AI Data Cloud

Snowflake introduces security innovations for a trusted AI Data Cloud, focusing on data protection and secure AI integration.

Showing 1 to 50 of 1342 articles
...