Use Tablib to Deal with Easy Tabular Information in Python | by Eirik Berge, PhD

Generally a Shallow Abstraction is extra Precious than Efficiency

Introduction — What’s Tablib?
Working with Datasets
Importing Information
Exporting Information
Dynamic Columns
Formatters
Wrapping Up

For a few years I’ve been working with instruments like Pandas and PySpark in Python for information ingestion, information processing, and information exporting. These instruments are nice for complicated information transformations and large information sizes (Pandas when the information suits in reminiscence). Nevertheless, usually I’ve used these instruments when the next circumstances apply:

The information measurement is comparatively small. Suppose nicely under 100,000 rows of knowledge.
Efficiency shouldn’t be a problem in any respect. Consider a one-off job or a job that repeats at midnight each night time, however I don’t care if it takes 20 seconds or 5 minutes.
There are not any complicated transformations wanted. Consider merely importing 20 JSON recordsdata with the identical format, stacking them on prime of one another, after which exporting this as a CSV file.

Use Tablib to Deal with Easy Tabular Information in Python | by Eirik Berge, PhD | Nov, 2024

Unleash your Salesforce knowledge utilizing the Amazon Q Salesforce On-line connector

Effectively practice fashions with massive sequence lengths utilizing Amazon SageMaker mannequin parallel

Effectively practice fashions with massive sequence lengths utilizing Amazon SageMaker mannequin parallel

Leave a Reply Cancel reply

Popular News

Greatest practices for Amazon SageMaker HyperPod activity governance

How Cursor Really Indexes Your Codebase

Context Engineering — A Complete Fingers-On Tutorial with DSPy

Construct a serverless audio summarization resolution with Amazon Bedrock and Whisper

Speed up edge AI improvement with SiMa.ai Edgematic with a seamless AWS integration

About Us

Category

Recent Posts