mamot.fr is one of the many independent Mastodon servers you can use to participate in the fediverse.
Mamot.fr est un serveur Mastodon francophone, géré par La Quadrature du Net.

Server stats:

3.1K
active users

#datawarehouse

3 posts1 participant0 posts today
Francis 🏴‍☠️ Gulotta<p>I’ve been working on a pretty gnarly data a warehouse reporting problem for the past few days. It’s up, leveling my ability to do this kind of work. The tooling has always been so limited and I am beginning to understand it is me who is limited in the understanding of the tooling ecosystem.</p><p>There may or may not be a wonderful overlap of programming and data warehousing but it’s clear that me not being aware of it doesn’t mean it doesn’t exist.</p><p><a href="https://toot.cafe/tags/DataWarehouse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataWarehouse</span></a> <a href="https://toot.cafe/tags/bigdata" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>bigdata</span></a> <a href="https://toot.cafe/tags/reporting" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>reporting</span></a></p>
Justin Buzzard<p>A Data Lake in the software world is essentially where raw data is taken and turned into something tangible like reports, often using AI/machine learning and them put into the Data Warehouse. <a href="https://mastodon.social/tags/software" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>software</span></a> <a href="https://mastodon.social/tags/datalake" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datalake</span></a> <a href="https://mastodon.social/tags/datawarehouse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datawarehouse</span></a></p>
Renne Rocha<p>I just discovered that Snowflake (the company) has its name not because it makes it possible to create a beautiful logo of a snowflake, but because Snowflake Schema is a pattern for storing information in Data Warehouses (we also have Star Schema).</p><p><a href="https://chaos.social/tags/TIL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TIL</span></a> <a href="https://chaos.social/tags/DataWarehouse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataWarehouse</span></a></p>
Vic<p>An analysis of 100 Fortune 500 job postings reveals the tools and technologies shaping the data engineering field in 2025. Top skills in demand:<br>⁕ Programming Languages (196) - SQL (85), Python (76), Scala (14), Java (14)<br>⁕ ETL and Data Pipeline (136) - ETL (65), Data Integration (46)<br>⁕ Cloud Platforms (85) - AWS (45), GCP (26), Azure (14)<br>⁕ Data Modeling and Warehousing (83) - Data Modeling (40), Data Warehousing (22), Data Architecture (21)<br>⁕ Big Data Tools (67) - Spark (40), Big Data Tools (19), Hadoop (8)<br>⁕ DevOps, Version Control, and CI/CD (52) - Git (14), CI/CD (13), DevOps (7), Version Control (6), Terraform (6)<br>...</p><p><a href="https://techhub.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataEngineering</span></a> <a href="https://techhub.social/tags/BigData" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigData</span></a> <a href="https://techhub.social/tags/SQL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SQL</span></a> <a href="https://techhub.social/tags/Python" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Python</span></a> <a href="https://techhub.social/tags/ETL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ETL</span></a> <a href="https://techhub.social/tags/AWS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AWS</span></a> <a href="https://techhub.social/tags/CloudComputing" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>CloudComputing</span></a> <a href="https://techhub.social/tags/Spark" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Spark</span></a> <a href="https://techhub.social/tags/DataModeling" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataModeling</span></a> <a href="https://techhub.social/tags/DataWarehouse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataWarehouse</span></a> <a href="https://techhub.social/tags/DevOps" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DevOps</span></a> <a href="https://techhub.social/tags/DataGovernance" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataGovernance</span></a> <a href="https://techhub.social/tags/DataVisualization" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataVisualization</span></a> <a href="https://techhub.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MachineLearning</span></a> <a href="https://techhub.social/tags/API" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>API</span></a> <a href="https://techhub.social/tags/Scala" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scala</span></a> <a href="https://techhub.social/tags/Java" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Java</span></a> <a href="https://techhub.social/tags/GCP" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GCP</span></a> <a href="https://techhub.social/tags/Azure" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Azure</span></a> <a href="https://techhub.social/tags/Hadoop" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Hadoop</span></a> <a href="https://techhub.social/tags/Git" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Git</span></a> <a href="https://techhub.social/tags/CICD" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>CICD</span></a> <a href="https://techhub.social/tags/Terraform" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Terraform</span></a> <a href="https://techhub.social/tags/DataQuality" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataQuality</span></a> <a href="https://techhub.social/tags/Tableau" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Tableau</span></a> <a href="https://techhub.social/tags/PowerBI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>PowerBI</span></a> <a href="https://techhub.social/tags/Collaboration" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Collaboration</span></a> <a href="https://techhub.social/tags/Microservices" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Microservices</span></a> <a href="https://techhub.social/tags/MLOps" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MLOps</span></a> <a href="https://techhub.social/tags/TechSkills" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TechSkills</span></a></p><p><a href="https://www.reddit.com/r/dataengineering/comments/1hz5ytw/become_a_data_engineer_in_2025_based_on_100_jobs/?utm_source=perplexity&amp;rdt=54709" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">reddit.com/r/dataengineering/c</span><span class="invisible">omments/1hz5ytw/become_a_data_engineer_in_2025_based_on_100_jobs/?utm_source=perplexity&amp;rdt=54709</span></a></p>
Anoncheg<p>Part1: <a href="https://techhub.social/tags/dailyreport" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>dailyreport</span></a> <a href="https://techhub.social/tags/powerbi" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>powerbi</span></a> <a href="https://techhub.social/tags/datawarehouse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datawarehouse</span></a> <a href="https://techhub.social/tags/dwh" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>dwh</span></a> <a href="https://techhub.social/tags/postgresql" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>postgresql</span></a><br> <a href="https://techhub.social/tags/python" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>python</span></a><br>At this week I installed PowerBI and connect it to remote<br> PostgreSQL.<br>I asked AI to compare open-source data sources for<br> PowerBI and compare them by:<br>- Ease of Setup on Linux: SQLite &gt; PostgreSQL &gt; MySQL &gt;<br> Redis &gt; MongoDB<br>- Performance:<br> + For large datasets: MongoDB &gt; PostgreSQL &gt; MySQL &gt;<br> Redis &gt; SQLite.<br> + For real-time operations: Redis &gt; MongoDB &gt; MySQL &gt;<br> PostgreSQL &gt; SQLite.</p><p>For PostgreSQL I prepare data in Python script that use:<br>- pandas - for coverting types to datetime and numeric<br>- sqlalchemy - for simplifying type converstion<br>- asyncpg - sqlalchemy backend to connect to PostgreSQL</p>
Martin De Wulf<p>I love the vocabulary in data science: datalakehouse ? warecatalog? workflowlines? </p><p>Anything is possible!</p><p><a href="https://mastodon.social/tags/data" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>data</span></a> <a href="https://mastodon.social/tags/datawarehouse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datawarehouse</span></a></p>
Sarah Lea<p>One of the most highlighted parts: "There is no need to move data. Data latency is minimised. Data can be transformed and analysed within a single platform.“</p><p>This is one of the reasons for 'Why ETL-Zero' :blobcoffee: </p><p><a href="https://towardsdatascience.com/why-etl-zero-understanding-the-shift-in-data-integration-as-a-beginner-d0cefa244154" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">towardsdatascience.com/why-etl</span><span class="invisible">-zero-understanding-the-shift-in-data-integration-as-a-beginner-d0cefa244154</span></a></p><p><a href="https://techhub.social/tags/data" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>data</span></a> <a href="https://techhub.social/tags/datascience" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datascience</span></a> <a href="https://techhub.social/tags/dataanalysis" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>dataanalysis</span></a> <a href="https://techhub.social/tags/dataanalytics" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>dataanalytics</span></a> <a href="https://techhub.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataEngineering</span></a> <a href="https://techhub.social/tags/sql" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>sql</span></a> <a href="https://techhub.social/tags/salesforce" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>salesforce</span></a> <a href="https://techhub.social/tags/etl" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>etl</span></a> <a href="https://techhub.social/tags/datawarehouse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datawarehouse</span></a> <a href="https://techhub.social/tags/datalake" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datalake</span></a> <a href="https://techhub.social/tags/datalakehouse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datalakehouse</span></a> <a href="https://techhub.social/tags/programming" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>programming</span></a></p>
Sarah Lea<p>In a data warehouse you store structured &amp; organized data. In a data lake you can additionally store unstructured data. And was is now a data lakehouse? </p><p>Think of a combination of the strengths of both previous data platforms. :blobcoffee: </p><p><a href="https://towardsdatascience.com/sql-and-data-modelling-in-action-a-deep-dive-into-data-lakehouses-fcbab9a4b9c2" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">towardsdatascience.com/sql-and</span><span class="invisible">-data-modelling-in-action-a-deep-dive-into-data-lakehouses-fcbab9a4b9c2</span></a></p><p><a href="https://techhub.social/tags/data" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>data</span></a> <a href="https://techhub.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataEngineering</span></a> <a href="https://techhub.social/tags/datalakehouse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datalakehouse</span></a> <a href="https://techhub.social/tags/datacenters" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datacenters</span></a> <a href="https://techhub.social/tags/datawarehouse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datawarehouse</span></a> <a href="https://techhub.social/tags/datalake" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datalake</span></a> <a href="https://techhub.social/tags/datascience" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datascience</span></a> <a href="https://techhub.social/tags/sql" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>sql</span></a></p>
dominucco<p>Data Lakes, Data Silos, Data Warehouses, OH MY! </p><p>Don't let siloed data slow your business down. Let Alice bring it all together!</p><p><a href="https://mastodon.social/tags/ETL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ETL</span></a> <a href="https://mastodon.social/tags/automation" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>automation</span></a> <a href="https://mastodon.social/tags/datawarehouse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datawarehouse</span></a> </p><p><a href="https://www.youtube.com/watch?v=-ip-BTFX25o" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">youtube.com/watch?v=-ip-BTFX25</span><span class="invisible">o</span></a></p>