Introducing Clones An efficient way to make copies of large datasets for testing, sharing and reproducing ML experiments We are…
Continue Readingtable
Enabling Spark SQL DDL and DML in Delta Lake on Apache Spark 3.0
Last week, we had a fun Delta Lake 0. 7. 0 + Apache Spark 3. 0 AMA where Burak Yavuz,…
Continue ReadingHow to Extract tabular data from PDF document using Camelot in Python
Introduction PDF or Portable Document File format is one of the most common file formats in today’s time. It is…
Continue Reading10+ Simple Yet Powerful Excel Tricks for Data Analysis
Overview Microsoft Excel is one of the most widely used tools for data analysis Learn the essential Excel functions used…
Continue ReadingHere’s How to Build a Pivot Table using Pandas in Python
Pivot tables – the Swiss Army Knife of data analysis I love how quickly I can analyze data using pivot…
Continue ReadingFaster ClickHouse Imports
'; The above took 20 minutes and 23 seconds to complete. Ill launch ClickHouses Client, create a table pointing to…
Continue ReadingDiving Into Delta Lake: Unpacking The Transaction Log
Breaking Down Transactions Into Atomic Commits Whenever a user performs an operation to modify a table (such as an INSERT,…
Continue ReadingDatabase Normalization Explained
Database Normalization ExplainedLearn about database normalization by designing and modifying an example database schema!Lorraine LiBlockedUnblockFollowFollowingJul 2Normalization is a technique for organizing…
Continue ReadingRelational Database Management (RDBMS) Basic for Data Professionals
Definitive Guide of Data ProfessionalsRelational Database Management (RDBMS) Basic for Data ProfessionalsBasic RDBMS with Python SQLite3 and SQLAlchemyVincent TatanBlockedUnblockFollowFollowingJun 23Source…
Continue ReadingRAM Rekt! EOS Storage Pitfalls
EOS Storage PitfallsKeith MukaiBlockedUnblockFollowFollowingMay 23A case study on disastrous smart contract data design… and how to fix it. This discussion will…
Continue ReadingLaravel: A single table for each Model type
Or you should repeat yourself a little?Italo BaezaBlockedUnblockFollowFollowingMay 13Photo by Keagan Henman on UnsplashSometimes we have in our application two (or more) Models…
Continue ReadingQuerying the Premier League using Python and SQL Combined
Querying the Premier League using Python and SQL CombinedFrom Excel to MySQL via Python, and then back to ExcelStephen FordhamBlockedUnblockFollowFollowingMay 9It may…
Continue ReadingQuerying the Premier League
Querying the Premier League-Using MySQL with PythonStephen FordhamBlockedUnblockFollowFollowingMay 9It may be useful sometimes to convert an Excel sheet into a MySQL database.…
Continue ReadingEcto Changesets — put, cast, embeds and assocs. Remember the difference once and for all!
Let’s see if we can get to the bottom of the difference between put_embed, put_assoc, cast_embed and cast_assoc once and…
Continue ReadingUpgrade your Python skills: Examining the Dictionary
Oh, And a set too. Huh?Dictionaries and sets in Python are implemented using a hash table. It may sound daunting…
Continue ReadingConsistent Hashing
Consistent HashingVivek Kumar SinghBlockedUnblockFollowFollowingMar 21Consistent hashing idea was introduced in paper Consistent Hashing and Random Trees: Distributed Caching Protocols for…
Continue ReadingBring Dynamo to the Data Science Party
Bring Dynamo to the Data Science PartyAd Hoc Querying of Dynamo Tables via Athena in a Jupyter NotebookAlex QuinteroBlockedUnblockFollowFollowingMar 29Dynamo is a…
Continue ReadingAnalysis of Developers Trends with JavaScript Pivot Table and Charting Library
Analysis of Developers Trends with JavaScript Pivot Table and Charting LibraryVeronika RovnikBlockedUnblockFollowFollowingMar 28Hi, dev community!Today I’d like to share my experience…
Continue ReadingWeb Scraping NBA Stats
Now we will use urlopen that we imported from the urllib. request library, then create a BeautifulSoup object by passing…
Continue ReadingHow to Build a Reporting Dashboard using Dash and Plotly
How to Build a Reporting Dashboard using Dash and PlotlyDavid ComfortBlockedUnblockFollowFollowingMar 11In this blog post, I will provide a step-by-step tutorial…
Continue ReadingThe Chi Square Statistic (p.1)
Try it. Notice that the new x² value is 4. 125 and this value exceeds the table value of 3.…
Continue ReadingAWS Infrastructure as Code with CDK
Wishing you could move away from JSON or YAML and instead implement your infrastructure with an imperative programming language?Released for…
Continue ReadingTerraform: Building out an Application Environment in AWS
Terraform: Building out an Application Environment in AWSGarrett SweeneyBlockedUnblockFollowFollowingMar 3Everyone loves the cloudToday we’re going to utilize Terraform to build out a…
Continue ReadingGuide to simple Rails features
Guide to simple Rails featuresMicah ShuteBlockedUnblockFollowFollowingFeb 20I recently made my first Ruby on Rails app — here are some resources and strategies I…
Continue Reading