docs(rfc): RFC 0004 — DataFrame und Blob (Content-Layer vs. Multi-Format)#73
Merged
Conversation
…mat) Analyse (Status: Proposed), ob DataFrame auf Blob erben soll. Befund: nein — das Blob-Modell (ein fixer Content-Blob) passt schlecht auf die lebende, in viele Formate kodierbare pandas-Tabelle. Optionen A (Vererbung, verworfen: to_dict-Bruch, Sync-Problem, Hash-Mehrdeutigkeit), B (Integritäts-Mixin), C (Komposition DataFrame.as_blob(fmt)), D (Status quo). Empfehlung: Komposition (C) + optional Mixin (B), keine Vererbung. Klärt die "Grundlage" als gemeinsamen Layer statt zwingender Basisklasse. In die mkdocs-RFC-Navigation aufgenommen.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Analyse-RFC (Status: Proposed, kein Code) als Abschluss von RFC 0003: soll
DataFrameaufBloberben?Befund
Nein. Das
Blob-Modell (ein fixer Content-Blob) passt schlecht auf dasDataFrame-Modell (eine lebende pandas-Tabelle, on-demand in viele Formate kodierbar — Parquet/Arrow/Feather/CSV/dict/JSON-LD/HDF5/Data Package).Optionen
DataFrame(Blob)— verworfen: bricht dasto_dict-Layout (data.parquet_bytes/column_metadatavs.data.content), Sync-Problem (lebende df), mehrdeutige Hash-Semantik.verify-Layer fürBlobundDataFrame, additiv.DataFrame.as_blob(fmt="parquet") -> Blob— Tabelle bei Bedarf als Blob in jedem Format ausgeben; null Bruchrisiko.Empfehlung
Komposition (C) als primärer Weg, optional ergänzt um den Mixin (B); keine Vererbung. Klärt die „Grundlage aller Datenformate" als gemeinsamen Layer (single-content erbt, mehrformatig komponiert), nicht als zwingende Basisklasse. Inkl. Testplan + offene Punkte (Prüfsummen-Determinismus bei Parquet, Default-Format).
In die mkdocs-RFC-Navigation aufgenommen;
mkdocs build --strictexit 0.