31/01/2025
Are you ready to advance your skills in Data Science and AI? Join the DeepTech_Ready Upskilling Programme, facilitated by https://datasciencenigeria.org/ and supported by https://www.google.org/
DeepTech_Ready combines a localized learning curriculum, guidance from expert career mentors, and a robust job-matching network to ensure you are job-ready for the global market in just 6 months!
Specializations Offered:
Advanced Data Analysis and Visualization
Data Science/Machine Learning
Data Architecture
Geospatial Data Science
Computer Vision
Natural Language Processing
Advanced Machine Learning Techniques
Apply here: https://app.3mtt.training/apply
Like, Share & Subscribe
29/01/2025
Clustered and non-clustered indexes are vital components of database management systems (DBMS), each significantly influencing performance and data retrieval.
Both types of indexes are crucial for optimizing database performance. The selection and design of indexes should be tailored to specific use cases, query patterns, and data characteristics to achieve an optimal balance between read and write performance.
In this newsletter, I have clearly outlined the key differences between clustered and non-clustered indexes. It is essential for every data professional, particularly those involved in data modeling for databases and data warehouses, to have a deep understanding of these differences. This knowledge enables them to apply the appropriate indexing strategies based on their specific use cases.
Key differences between clustered and non-clustered indexes | Soyoola Sodunke
In database management systems, indexes are used to speed up the retrieval of data from tables. The two main types of indexes are clustered and non-clustered indexes.
27/01/2025
At its core, AI is about learning from data and making intelligent decisions or predictions. The quality of data, the sophistication of algorithms, and computational power all play a role in how effective an AI system is.
23/01/2025
In the city of Ekom, there lived a curious data analyst named Alex, who was often called upon to solve complex data mysteries.
One day, he received a challenge from the CEO of Breez Inc., a thriving e-commerce company. Sales were fluctuating, and customer complaints were piling up. “We have all the data,” the CEO sighed, “but we don’t know what it’s telling us.”
Determined to crack the case, Alex dove into Breez’s database, which held two crucial tables:
𝐂𝐮𝐬𝐭𝐨𝐦𝐞𝐫𝐬 𝐓𝐚𝐛𝐥𝐞 (CustomerID, Name, Email)
𝐎𝐫𝐝𝐞𝐫𝐬 𝐓𝐚𝐛𝐥𝐞 (OrderID, CustomerID, Product, Amount, Date)
As Alex stared at the tables, they were just scattered pieces of the puzzle. The real story would only emerge by bringing them together.
Alex thought, “𝐿𝑒𝑡’𝑠 𝑠𝑡𝑎𝑟𝑡 𝑏𝑦 𝑙𝑜𝑜𝑘𝑖𝑛𝑔 𝑎𝑡 𝑐𝑢𝑠𝑡𝑜𝑚𝑒𝑟𝑠 𝑤ℎ𝑜 ℎ𝑎𝑣𝑒 𝑚𝑎𝑑𝑒 𝑝𝑢𝑟𝑐ℎ𝑎𝑠𝑒𝑠.”
Using an 𝐈𝐍𝐍𝐄𝐑 𝐉𝐎𝐈𝐍, Alex linked the Customers and Orders tables on the common 𝙲𝚞𝚜𝚝𝚘𝚖𝚎𝚛𝙸𝙳 column. This revealed only customers who had placed orders, allowing Alex to analyze spending patterns and identify top buyers.
𝐓𝐢𝐩
𝐼𝑁𝑁𝐸𝑅 𝐽𝑂𝐼𝑁 ℎ𝑒𝑙𝑝𝑠 𝑤ℎ𝑒𝑛 𝑦𝑜𝑢 𝑛𝑒𝑒𝑑 𝑜𝑛𝑙𝑦 𝑡ℎ𝑒 𝑟𝑒𝑙𝑒𝑣𝑎𝑛𝑡 𝑚𝑎𝑡𝑐ℎ𝑒𝑠—𝑝𝑒𝑟𝑓𝑒𝑐𝑡 𝑓𝑜𝑟 𝑝𝑒𝑟𝑓𝑜𝑟𝑚𝑎𝑛𝑐𝑒 𝑎𝑛𝑎𝑙𝑦𝑠𝑖𝑠 𝑎𝑛𝑑 𝑡𝑎𝑟𝑔𝑒𝑡𝑖𝑛𝑔 𝑎𝑐𝑡𝑖𝑣𝑒 𝑐𝑢𝑠𝑡𝑜𝑚𝑒𝑟𝑠.
Next, Alex wondered, “𝑊ℎ𝑎𝑡 𝑎𝑏𝑜𝑢𝑡 𝑐𝑢𝑠𝑡𝑜𝑚𝑒𝑟𝑠 𝑤ℎ𝑜 ℎ𝑎𝑣𝑒𝑛’𝑡 𝑜𝑟𝑑𝑒𝑟𝑒𝑑 𝑦𝑒𝑡? 𝑊𝑒 𝑚𝑖𝑔ℎ𝑡 𝑏𝑒 𝑚𝑖𝑠𝑠𝑖𝑛𝑔 𝑜𝑢𝑡 𝑜𝑛 𝑝𝑜𝑡𝑒𝑛𝑡𝑖𝑎𝑙 𝑏𝑢𝑠𝑖𝑛𝑒𝑠𝑠.”
A 𝐋𝐄𝐅𝐓 𝐉𝐎𝐈𝐍 was used to get all customers, including those who hadn't made any purchases. Sure enough, dozens of customer names appeared with NULL values in the orders section.
𝐓𝐢𝐩
𝐿𝐸𝐹𝑇 𝐽𝑂𝐼𝑁 𝑖𝑠 𝑡ℎ𝑒 𝑤𝑎𝑦 𝑡𝑜 𝑔𝑜 𝑤ℎ𝑒𝑛 𝑦𝑜𝑢 𝑤𝑎𝑛𝑡 𝑎𝑙𝑙 𝑟𝑒𝑐𝑜𝑟𝑑𝑠 𝑓𝑟𝑜𝑚 𝑜𝑛𝑒 𝑠𝑖𝑑𝑒, 𝑒𝑣𝑒𝑛 𝑖𝑓 𝑡ℎ𝑒𝑟𝑒 𝑎𝑟𝑒 𝑛𝑜 𝑐𝑜𝑟𝑟𝑒𝑠𝑝𝑜𝑛𝑑𝑖𝑛𝑔 𝑒𝑛𝑡𝑟𝑖𝑒𝑠 𝑜𝑛 𝑡ℎ𝑒 𝑜𝑡ℎ𝑒𝑟 𝑠𝑖𝑑𝑒—𝑔𝑟𝑒𝑎𝑡 𝑓𝑜𝑟 𝑖𝑑𝑒𝑛𝑡𝑖𝑓𝑦𝑖𝑛𝑔 𝑔𝑎𝑝𝑠 𝑎𝑛𝑑 𝑢𝑛𝑡𝑎𝑝𝑝𝑒𝑑 𝑜𝑝𝑝𝑜𝑟𝑡𝑢𝑛𝑖𝑡𝑖𝑒𝑠.
While analyzing supplier data, Alex used a 𝐑𝐈𝐆𝐇𝐓 𝐉𝐎𝐈𝐍 to check which suppliers had provided products, even if some weren’t linked to current sales. This helped pinpoint inventory gaps and product redundancies.
𝐓𝐢𝐩
𝑅𝐼𝐺𝐻𝑇 𝐽𝑂𝐼𝑁 ℎ𝑒𝑙𝑝𝑠 𝑤ℎ𝑒𝑛 𝑦𝑜𝑢𝑟 𝑓𝑜𝑐𝑢𝑠 𝑖𝑠 𝑜𝑛 𝑒𝑛𝑠𝑢𝑟𝑖𝑛𝑔 𝑎𝑙𝑙 𝑣𝑎𝑙𝑢𝑒𝑠 𝑓𝑟𝑜𝑚 𝑡ℎ𝑒 𝑟𝑖𝑔ℎ𝑡-𝑠𝑖𝑑𝑒 𝑡𝑎𝑏𝑙𝑒 𝑎𝑝𝑝𝑒𝑎𝑟 𝑖𝑛 𝑡ℎ𝑒 𝑟𝑒𝑠𝑢𝑙𝑡𝑠—𝑖𝑑𝑒𝑎𝑙 𝑓𝑜𝑟 𝑖𝑛𝑣𝑒𝑛𝑡𝑜𝑟𝑦 𝑎𝑛𝑑 𝑠𝑢𝑝𝑝𝑙𝑖𝑒𝑟 𝑎𝑢𝑑𝑖𝑡𝑠.
Still, something felt incomplete. Alex wondered if there were orphaned records—customers who never ordered and orders that had missing customer data.
By performing a 𝐅𝐔𝐋𝐋 𝐎𝐔𝐓𝐄𝐑 𝐉𝐎𝐈𝐍, all customers and orders were displayed, whether they matched or not. This exposed data inconsistencies, helping Breez clean their records.
𝐓𝐢𝐩
𝐹𝑈𝐿𝐿 𝑂𝑈𝑇𝐸𝑅 𝐽𝑂𝐼𝑁 𝑖𝑠 𝑡ℎ𝑒 𝑏𝑒𝑠𝑡 𝑐ℎ𝑜𝑖𝑐𝑒 𝑤ℎ𝑒𝑛 𝑦𝑜𝑢 𝑛𝑒𝑒𝑑 𝑒𝑣𝑒𝑟𝑦𝑡ℎ𝑖𝑛𝑔—𝑖𝑑𝑒𝑎𝑙 𝑓𝑜𝑟 𝑑𝑎𝑡𝑎 𝑎𝑢𝑑𝑖𝑡𝑠 𝑎𝑛𝑑 𝑟𝑒𝑐𝑜𝑛𝑐𝑖𝑙𝑖𝑎𝑡𝑖𝑜𝑛.
Finally, Alex needed to suggest personalized product recommendations. Using a 𝐂𝐑𝐎𝐒𝐒 𝐉𝐎𝐈𝐍, they generated all possible combinations of customers and products to create exciting bundle deals.
𝐓𝐢𝐩
𝐶𝑅𝑂𝑆𝑆 𝐽𝑂𝐼𝑁 ℎ𝑒𝑙𝑝𝑠 𝑤ℎ𝑒𝑛 𝑏𝑟𝑎𝑖𝑛𝑠𝑡𝑜𝑟𝑚𝑖𝑛𝑔 𝑝𝑜𝑠𝑠𝑖𝑏𝑖𝑙𝑖𝑡𝑖𝑒𝑠 𝑎𝑛𝑑 𝑐𝑟𝑒𝑎𝑡𝑖𝑛𝑔 𝑛𝑒𝑤 𝑜𝑝𝑝𝑜𝑟𝑡𝑢𝑛𝑖𝑡𝑖𝑒𝑠—𝑔𝑟𝑒𝑎𝑡 𝑓𝑜𝑟 𝑚𝑎𝑟𝑘𝑒𝑡𝑖𝑛𝑔 𝑠𝑡𝑟𝑎𝑡𝑒𝑔𝑖𝑒𝑠.
Armed with these powerful insights, Alex presented the findings to Breez’s CEO. Sales improved, customer engagement soared, and Alex earned the title of “𝐓𝐡𝐞 𝐃𝐚𝐭𝐚 𝐃𝐞𝐭𝐞𝐜𝐭𝐢𝐯𝐞 𝐨𝐟 𝐄𝐤𝐨𝐦.”
𝐓𝐡𝐞 𝐄𝐧𝐝. 😊
Visit this link to learn more: https://soyoolasodunke.hashnode.dev/joins-in-sql
soyoolasodunke.hashnode.dev
20/01/2025
It's easy to find answers to the question: How many categories of data exist in the field of data analytics?
The common response is straightforward. However, over time, I've realized that many data specialists focus on utilizing these categories without fully understanding how data is classified into them—or why these classifications matter.
This newsletter aims to simplify that complexity for you.
We'll explore how different types of data—structured, semi-structured, and unstructured—come to be. You'll discover what makes structured data "structured," how semi-structured data strikes a balance, and why unstructured data remains as it is. More importantly, we'll discuss why these distinctions are crucial for data analytics and business success.
Additionally, we'll uncover a fundamental concept: What sets data apart from information? Understanding this difference is key to making informed decisions and driving value from data. Enjoy learning! 🙂
Structured or not? The 3 Faces of Data You Need to Uncover | Soyoola Sodunke
The concepts of data and information are fundamental across various fields, including computing, data analytics, data science, and knowledge management. While these terms are often used interchangeably, they serve distinct purposes and play unique roles in decision-making and analysis.
19/01/2025
Imagine being able to extract insights from complex data sets, create stunning visualizations, and make informed decisions that drive business results. Imagine having the skills and confidence to tackle any data challenge that comes your way.
Welcome to Pemdas Learn, your go-to destination for learning data analytics tools such as SQL, Power BI, Microsoft Fabric, Azure, Microsoft Excel, and others! My channel is dedicated to empowering beginners and pros alike with the knowledge and skills they need to succeed in the world of data.
Subscribe now and get ready to boost your career prospects with in-demand data skills, increase your productivity and efficiency with streamlined workflows, make better decisions with data-driven insights.
Click the link to subscribe and start your journey to becoming a data master today!