Skip to content

Commit 8c6c315

Browse files
fabclmntboris-koganrenovate[bot]vascoalramosricardodcpereira
authored
docs: update readme with advent of code (#1516)
* Update duplicates_pandas.py (#1427) Fixing Bug Report #1384 Dataset with categorical features causes memory error even on tiny dataset. * chore(actions): update sonarsource/sonarqube-scan-action action to v2.0.1 * chore(actions): update actions/checkout action to v4 * docs: setup new docs with mkdocs (#1418) * chore(actions): update actions/checkout action to v4 * fix: remove the duplicated cardinality threshold under categorical and text settings * fix: fixate matplotlib upper version * docs: change from `zap` to `sparkles` (#1447) Co-authored-by: Fabiana <30911746+fabclmnt@users.noreply.github.com> * fix: template {{ file_name }} error in HTML wrapper (#1380) * Update javascript.html * Update style.html * feat: add density histogram (#1458) * feat: add histogram density option * test: add unit test * fix: discard weights if exceed max_bins * docs: update README.html (#1461) Update url of use cases, main integrations, and common issues. * fix: bug when creating a new report (#1440) * fix: gen wordcloud only for non-empty cols (#1459) * fix: table template ignoring text format (#1462) * fix: table template ignoring text format * fix: timeseries unit test * fix(linting): code formatting --------- Co-authored-by: Azory YData Bot <azory@ydata.ai> * fix: to_category misshandling pd.NA (#1464) * docs: add 📊 for Key features (#1451) See also #1445 (comment) * docs: fix hyperlink - related to package name change (#1457) Co-authored-by: Martin Mokry <martin-kokos@users.noreply.github.com> * chore(deps): increase numpy upper limit (#1467) * chore(deps): increase numpy upper limit * chore(deps): fixate numpy version for spark * chore(deps): fix numba package version, and filter warns (#1468) * chore: fix numba package version, and filter warns * fix: skip isort linter on init * chore(deps): update dependency typeguard to v4 (#1324) * chore(deps): update dependency typeguard to v4 --------- Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> Co-authored-by: Maciej Bukczynski <maciej@darkhorseanalytics.com> * docs: update docs with advent of code * docs: update links for fabric --------- Co-authored-by: boris-kogan <139680785+boris-kogan@users.noreply.github.com> Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> Co-authored-by: Vasco Ramos <vasco.ramos@ydata.ai> Co-authored-by: ricardodcpereira <ricardo.pereira@ydata.ai> Co-authored-by: Anselm Hahn <Anselm.Hahn@gmail.com> Co-authored-by: Joge <87136119+jogecodes@users.noreply.github.com> Co-authored-by: Alex Barros <alexbarros@users.noreply.github.com> Co-authored-by: Miriam Seoane Santos <68821478+miriamspsantos@users.noreply.github.com> Co-authored-by: Chris Mahoney <44449504+chrimaho@users.noreply.github.com> Co-authored-by: Azory YData Bot <azory@ydata.ai> Co-authored-by: martin-kokos <4807476+martin-kokos@users.noreply.github.com> Co-authored-by: Martin Mokry <martin-kokos@users.noreply.github.com> Co-authored-by: Maciej Bukczynski <maciej@darkhorseanalytics.com> Co-authored-by: Fabiana Clemente <fabianaclemente@Fabianas-MacBook-Air.local>
1 parent 06b6535 commit 8c6c315

13 files changed

Lines changed: 40 additions & 31 deletions

File tree

.github/workflows/docs.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -64,7 +64,7 @@ jobs:
6464
git config core.autocrlf false
6565
6666
- name: Setup Python
67-
uses: actions/setup-python@v5
67+
uses: actions/setup-python@v4
6868
with:
6969
python-version: "3.10"
7070

.github/workflows/pull-request.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ jobs:
3737
git config core.autocrlf false
3838
3939
- name: Set up Python 3.8
40-
uses: actions/setup-python@v5
40+
uses: actions/setup-python@v4
4141
with:
4242
python-version: "3.10"
4343

@@ -89,7 +89,7 @@ jobs:
8989
- uses: actions/checkout@v4
9090

9191
- name: Setup Python
92-
uses: actions/setup-python@v5
92+
uses: actions/setup-python@v4
9393
with:
9494
python-version: "3.10"
9595

.github/workflows/release-deprecated.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ jobs:
1919
- uses: actions/checkout@v4
2020

2121
- name: Setup Python 3.8
22-
uses: actions/setup-python@v5
22+
uses: actions/setup-python@v4
2323
with:
2424
python-version: "3.8"
2525

.github/workflows/release.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ jobs:
2020
run: echo "value=${GITHUB_REF#refs/*/}" >> $GITHUB_OUTPUT
2121

2222
- name: Setup Python 3.10
23-
uses: actions/setup-python@v5
23+
uses: actions/setup-python@v4
2424
with:
2525
python-version: "3.10"
2626

.github/workflows/tests.yml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -53,7 +53,7 @@ jobs:
5353
- uses: actions/checkout@v4
5454

5555
- name: Setup python
56-
uses: actions/setup-python@v5
56+
uses: actions/setup-python@v4
5757
with:
5858
python-version: ${{ matrix.python-version }}
5959
architecture: x64
@@ -101,7 +101,7 @@ jobs:
101101
- uses: actions/checkout@v4
102102

103103
- name: Setup python
104-
uses: actions/setup-python@v5
104+
uses: actions/setup-python@v4
105105
with:
106106
python-version: ${{ matrix.python-version }}
107107
architecture: x64
@@ -185,7 +185,7 @@ jobs:
185185
steps:
186186
- uses: actions/checkout@v4
187187
- name: Setup python
188-
uses: actions/setup-python@v5
188+
uses: actions/setup-python@v4
189189
with:
190190
python-version: ${{ matrix.python-version }}
191191
architecture: x64

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@
3030
The package outputs a simple and digested analysis of a dataset, including **time-series** and **text**.
3131

3232
> **Looking for a scalable solution that can fully integrate with your database systems?**<br>
33-
> Leverage YData Fabric Data Catalog to connect to different databases and storages (Oracle, snowflake, PostGreSQL, GCS, S3, etc.) and leverage an interactive and guided profiling experience in Fabric. Check out the [Community Version](https://ydata.ai/ydata-fabric-free-trial).
33+
> Leverage YData Fabric Data Catalog to connect to different databases and storages (Oracle, snowflake, PostGreSQL, GCS, S3, etc.) and leverage an interactive and guided profiling experience in Fabric. Check out the [Community Version](http://ydata.ai/register?utm_source=ydata-profiling&utm_medium=documentation&utm_campaign=YData%20Fabric%20Community).
3434
3535
## ▶️ Quickstart
3636

docs/advanced_settings/collaborative_data_profiling.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -56,4 +56,4 @@ users and per project. YData Fabric Data Catalog helps in maintinaing
5656
regulatory compliance by identifying any sensitive data.
5757

5858
Try today the Catalog experience in with [Fabric Community
59-
version](https://ydata.ai/ydata-fabric-free-trial)!
59+
version](http://ydata.ai/register?utm_source=ydata-profiling&utm_medium=documentation&utm_campaign=YData%20Fabric%20Community)!

docs/features/big_data.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ computation time of the profiling becomes a bottleneck,
1313
!!! info "Scale in a fully managed system"
1414

1515
Looking for an fully managed system that is able to scale the profiling
16-
for large datasets? [Sign up Fabric](https://ydata.ai/ydata-fabric-free-trial)
16+
for large datasets? [Sign up Fabric](http://ydata.ai/register?utm_source=ydata-profiling&utm_medium=documentation&utm_campaign=YData%20Fabric%20Community)
1717
community for distributed data profiling.
1818

1919
## Pyspark

docs/features/collaborative_data_profiling.md

Lines changed: 9 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
1-
# Data quality Profiling with a Collaborative experience
1+
# Data Catalog - A collaborative experience to profile datasets & relational databases
22

3-
!!! note
3+
!!! note "Data Catalog with data quality profiling"
44

5-
[Sign-up Fabric community](https://ydata.ai/ydata-fabric-free-trial) to try the **data catalog**
6-
and **collaborative** experience for data profiling at scale!
5+
[Sign-up Fabric community](http://ydata.ai/register?utm_source=ydata-profiling&utm_medium=documentation&utm_campaign=YData%20Fabric%20Community) to try the **data catalog**
6+
and **collaborative** experience for datasets and database profiling at scale!
77

88
[YData Fabric](https://ydata.ai/products/fabric) is a Data-Centric AI
99
development platform. YData Fabric provides all capabilities of
@@ -42,6 +42,10 @@ An interactive experience that allows to drill-down in a comprehensive data prof
4242
and relationship analysis, providing deep insights into data structure,
4343
distributions and interactions for improved data preparation.
4444

45+
<p style="text-align:center;">
46+
<iframe width="560" height="315" src="https://www.youtube.com/embed/9EupCg5YQLE?si=Tuu68p6sj_RzxTBn&amp;clip=UgkxGNvIAcxUiqBSepTZzP2-4evffzjU7aHX&amp;clipt=EJbiBxinoAg" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
47+
</p>
48+
4549
### Data quality indexes
4650

4751
Access and navigate indicators and data quality statistics, such as completeness, uniqueness
@@ -61,4 +65,4 @@ users and per project. YData Fabric Data Catalog helps in maintaining
6165
regulatory compliance by identifying any sensitive data.
6266

6367
Try today the Catalog experience in with [Fabric Community
64-
version](https://ydata.ai/ydata-fabric-free-trial)!
68+
version](http://ydata.ai/register?utm_source=ydata-profiling&utm_medium=documentation&utm_campaign=YData%20Fabric%20Community)!

docs/getting-started/concepts.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -99,7 +99,7 @@ This section provides a comprehensive profiling over the potential dataset outli
9999
based on observed variance.
100100
The identification of outliers allows the data analyst or scientist to assess whether they are genuine data anomalies or erroneous entries, allowing for informed decisions on whether to retain, transform, or exclude these points in further analyses.
101101

102-
Feature limited to user of the [cloud hosted solution](https://ydata.ai/ydata-fabric-free-trial).
102+
Feature limited to user of the [cloud hosted solution](http://ydata.ai/register?utm_source=ydata-profiling&utm_medium=documentation&utm_campaign=YData%20Fabric%20Community).
103103

104104
## Preview data
105105
For a quick overview of the data, ydata-profiling provides the following sections that can be easily configure by the user:

0 commit comments

Comments
 (0)