Skip to content

Commit 7a7a98c

Browse files
committed
removed average account balance by type of credit card
1 parent 925a38a commit 7a7a98c

2 files changed

Lines changed: 0 additions & 14 deletions

File tree

_pages/syndiffix-mostlyai-ctgan.md

Lines changed: 0 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -12,20 +12,6 @@ For this comparison, we use the [Czech banking dataset](https://data.world/lpetr
1212

1313
The code for some of the following examples can be found [here](https://github.com/diffix/syndiffix/blob/main/docs/time-series.ipynb).
1414

15-
## Average account balance by type of credit card
16-
17-
892 of the 4500 bank accounts have credit cards. There are three classes of card, 'junior', 'classic', and 'gold'. Here we ask the question, "Does the average account balance differ based on the credit card class?"
18-
19-
The following graphs are percentile boxplots (100th, 75th, 50th, 25th, and 0th percentiles, plus outliers) of the average account balance for each account with a credit card. The two plots show the same data, but grouped by synthesis method on the left, and by card type on the right.
20-
21-
{% include image.html src="/assets/img/trans_account_card_clients.avg_balance.card_type.png" alt="SynDiffix usage style" max_width="650px" %}
22-
23-
The SynDiffix synthetic data was produced by first computing the average balance per account from the original data to produce a two-column table consisting of the `avg_balance` and the `card_type`, with one row per account (i.e. not time-series data). This two-column table was then synthesized.
24-
25-
From the original data, we see that indeed gold credit card holders have higher average account balances than classic card holders, which in turn are a little higher than junior cord holders. The data for SynDiffix is very close to the original data, and accurately shows the same trend.
26-
27-
The data for Mostly AI and CTGAN are less accurate. Both show a slightly higher median balance for gold cards, but fail to show a lower balance for junior cards. Both Mostly AI and CTGAN show a much wider range of average balances, include negative average balances where none should exist.
28-
2915
## Distribution of deposit and withdrawal amounts
3016

3117
There are five types of transaction operations. One of them represents deposits (VKLAD), while the other four represent different types of withdrawals (at least as near as we can tell from the Czech translations).

assets/img/compare-link.png

7.87 KB
Loading

0 commit comments

Comments
 (0)