Python Parquet and Arrow: Using PyArrow with Pandas

Parquet and Arrow are two Apache projects available in Python via the PyArrow library. Parquet is an efficient, compressed, column-oriented storage format for arrays and tables of data. Arrow is an in-memory columnar format for data analysis that is designed to be used across different languages. It currently boasts supported libraries for several important languages, … Read more

CodeSolid Newsletter 12/13/2022

Hello, CodeSolid Subscriber, I hope you’re doing well. It’s time for another review of what’s new on CodeSolid and around selected parts of the rest of the Python Interwebs. New On CodeSolid On CodeSolid, our new author, Bashir Alam, has worked very hard on our Pandas Series, with two recent articles on Pandas: How to … Read more

Python Data Analysis Starter Project

colored-scatter-plot

Installing the Tools for the Pandas Series This article contains the instructions for installing the Python modules that you’ll need to run the code in our Pandas Series. If you need to do this, you can skip ahead to the section “Using the Project,” or feel free to read the next section for more background. … Read more

CodeSolid Newsletter, 11/29/2022

Dear CodeSolid Subscriber, Hello again.  US readers, I hope you had an awesome Thanksgiving.  Oh, let’s not be exclusive about this:  I hope you had an awesome Thanksgiving no matter where you are.  Thanksgiving was Thursday. As many of you know, I’ve been wrestling a bit with the frequency of this newsletter.  Once per week … Read more

Clicky