mastodon.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
The original server operated by the Mastodon gGmbH non-profit

Administered by:

Server stats:

297K
active users

#datascience

152 posts92 participants1 post today

Explore the Future of #AI & #DataScience at #FOSSASIASummit2025

Join the AI & Data Science Track and gain insights from top AI researchers, developers, and industry leaders from AWS, Google, Fujitsu, Mercari, and more!

🔥Open Source AI & real-world applications
🔥 Deep dive into LLMs & ChatGPT’s architecture
🔥 Next-gen search technologies & large-scale AI solutions

📍 March 13-15, 2025 | True Digital Park, Bangkok
🎟️ Check it out! eventyay.com/e/4c0e0c27/schedu

Continued thread

As a process, ETL takes the data from the lake to the warehouse for BI and other reporting systems. The greatest / worst thing about data lakes is that they store data in its raw form. Hence, the importance of good ETL practices... even though systems rely on raw data for insights.

Lakehouses make it a bit better by enabling warehouse level governance at the lake level. Kinda like a lake with an auto-ETL layer

datatofu.wordpress.com

Tags:

Digestible Data Analytics (DDA)Digestible Data Analytics (DDA)Serving you digestible big data analysis and analytics systems.

📢 QuadratiK is now part of pyOpenSci! 🎉
QuadratiK makes it easier to analyze and compare data with:
✅ Goodness-of-Fit tests – check if your data follows a pattern
✅ Clustering – find natural groups in your data
✅ Sampling tools – create new data based on probability models
✅ Visualization tools – make sense of results easily
✅ A dashboard – so you don’t need to code!
📖 Read more: pyopensci.org/blog/quadratik.h

pyOpenSci · QuadratiK: Collection of Methods Constructed using Kernel-Based Quadratic DistancesQuadratiK provides a set of goodness-of-fit tests, a clustering technique using kernel-based quadratic distances, and algorithms for generating random samples from Poisson kernel-based distributions (PKBD). QuadratiK has recently been accepted into the pyOpenSci ecosystem.

🌟 Recording of our last R-Ladies Cologne meeting is now available! 🌟
In this session, Sébastien Rochette introduced his package creation process with Fusen—a powerful tool for R users. If you missed the meeting or want to review it, check out the recording!
📽️ youtu.be/wAkZvwPK1P4?feature=s
#RLadies #RStats #Fusen #DataScience

I'm heading few of data science ppl who could be tigers, but they think they can only be kittens. 😐

But ya the company hasn't given them any real projects, may be due to ultra weak marketing, or due to domain ignorance.

So I'm appealing to the world, does anyone want Data Science, Artificial intelligence, and machine learning projects to be done @ dirt cheap rates?

It will be used to uplift the life of people who should be A-class engineers.

Clemson News: The hidden impact of data on gas prices. “With apps and websites providing access to competitors’ prices, gas station owners can adjust their pricing strategies without having to drive around and collect data physically. However, [Professor Matthew] Lewis’ research suggests that this newfound transparency doesn’t necessarily benefit consumers in the way one might expect.”

https://rbfirehose.com/2025/02/12/clemson-news-the-hidden-impact-of-data-on-gas-prices/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · Clemson News: The hidden impact of data on gas prices | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose
Continued thread

Even if we consider immigration as a special use case; immigration control could target finding forgeries, criminals, illegal immigrants, and prohibited persons. Applying it to a country’s borders could be a prudent application. The necessary data could come from passport photos and other passport metadata, such as the passport code location. The catch? ... a local problem could turn global.

datatofu.wordpress.com

Tags:

Digestible Data Analytics (DDA)Digestible Data Analytics (DDA)Serving you digestible big data analysis and analytics systems.
Replied in thread

@hendrik @pandasiusfilet

Hey there! 😊 Sorry to bother you, but I’m considering upgrading my homelab with some P40 or M100 GPUs. I mainly need more VRAM for my projects, and I was really inspired by the channel "Homelab AI" and their video on building a DIY 4x Nvidia P40 homeserver with 96GB of VRAM! If you haven’t seen it yet, you can check it out here: DIY 4x Nvidia P40 Homeserver for A youtu.be/dHTvpUlWFbk . I’d love to hear your thoughts or any tips you might have? Thanks a lot! 🙌 #ai #machinelearning #artificialintelligence #Homelab #AI #Nvidia #GPUs #P40 #M100 #VRAM #DIY #TechProjects #MachineLearning #DataScience #ServerBuild #TechInspiration #HomeServer #CloudComputing

YouTubeDIY 4x Nvidia P40 Homeserver for AI with 96gb VRAM!We've built a homeserver for AI experiments, featuring 96 GB of VRAM and 448 GB of RAM, with an AMD EPYC 7551P processor. We'll be testing our Tesla P40 GPUs...