Hey data engineering twitter, what's the current state of the art in #pandas-to-postgres or Arrow-to-postgres? Redshift supports COPY directly on parquet files but regular old postgres doesn't. Is there a way to avoid the CSV format ser/de?
-
-
Hey, quick question, what is going on here in this whole if statement? Isn't returning col.astype('category') enough? I can see it being something with the types of the categories but then how does len(cats) help you decide anything? https://github.com/spitz-dan-l/postgres-binary-parser/blob/64f54da0c6e6821467a217bfc3b0726f10b12173/postgres_binary_parser/schema.py#L382 …
-
The intention was: turn it categorical, and force the categories to be strings. reduce_categories was written before remove_unused_categories was added to pandas, so it was intended to make the string conversion as efficient as possible. Probably not relevant to your use case
- 11 more replies
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.