From d8d46f612550872c488e8b19e0e195776a3052c6 Mon Sep 17 00:00:00 2001 From: Huan Chen Date: Thu, 27 Mar 2025 22:20:10 +0000 Subject: [PATCH 1/9] docs: add warning for bigframes 2.0 --- README.rst | 14 ++++++++++++++ 1 file changed, 14 insertions(+) diff --git a/README.rst b/README.rst index 185c50c14a..cb6cfab94f 100644 --- a/README.rst +++ b/README.rst @@ -3,6 +3,20 @@ BigQuery DataFrames |GA| |pypi| |versions| +.. raw:: html + +
+

⚠️ Breaking Changes in BigQuery DataFrames v2.0

+

Version 2.0 introduces breaking changes for improved security and performance. Key default behaviors have changed:

+ +

Important: If you are not ready to adapt to these changes, please pin your dependency to a version less than 2.0 (e.g., bigframes==1.38.0) to avoid disruption.

+

Please review the updated introduction guide for more details on v2.0 behavior.

+
+ BigQuery DataFrames provides a Pythonic DataFrame and machine learning (ML) API powered by the BigQuery engine. From 166f3807dadaac5adb7f15955d18c0cfb2c561ee Mon Sep 17 00:00:00 2001 From: Huan Chen Date: Thu, 27 Mar 2025 22:55:01 +0000 Subject: [PATCH 2/9] style update --- README.rst | 7 ++++--- 1 file changed, 4 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index cb6cfab94f..97e12cebd4 100644 --- a/README.rst +++ b/README.rst @@ -5,16 +5,17 @@ BigQuery DataFrames .. raw:: html -
+

⚠️ Breaking Changes in BigQuery DataFrames v2.0

Version 2.0 introduces breaking changes for improved security and performance. Key default behaviors have changed:

  • Large Results (>10GB): Methods like to_pandas() will now fail by default due to stricter limits being enforced.
  • -
  • Remote Function Security: The library no longer defaults to using the Compute Engine service account, and network ingress now defaults to "internal-only", impacting default deployment permissions and connectivity.
  • +
  • Remote Function Security: The library no longer defaults to using the Compute Engine service account, and network ingress now defaults to "internal-only".
  • Endpoint Connections: Automatic fallback to locational endpoints in certain regions is removed. Connections that previously relied on this fallback will no longer establish successfully.

Important: If you are not ready to adapt to these changes, please pin your dependency to a version less than 2.0 (e.g., bigframes==1.38.0) to avoid disruption.

-

Please review the updated introduction guide for more details on v2.0 behavior.

+

Please review the updated introduction guide for more details on v2.0 behavior.

BigQuery DataFrames provides a Pythonic DataFrame and machine learning (ML) API From 36838628acc96012d48f84f71c2bc78866ce034d Mon Sep 17 00:00:00 2001 From: Huan Chen Date: Thu, 27 Mar 2025 23:12:36 +0000 Subject: [PATCH 3/9] update content --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 97e12cebd4..c469e19352 100644 --- a/README.rst +++ b/README.rst @@ -10,9 +10,9 @@ BigQuery DataFrames

⚠️ Breaking Changes in BigQuery DataFrames v2.0

Version 2.0 introduces breaking changes for improved security and performance. Key default behaviors have changed:

    -
  • Large Results (>10GB): Methods like to_pandas() will now fail by default due to stricter limits being enforced.
  • +
  • Large Results (>10GB): The default value for allow_large_results has changed to False. Methods like to_pandas() will now fail if the query result's compressed data size exceeds 10GB, unless large results are explicitly permitted.
  • Remote Function Security: The library no longer defaults to using the Compute Engine service account, and network ingress now defaults to "internal-only".
  • -
  • Endpoint Connections: Automatic fallback to locational endpoints in certain regions is removed. Connections that previously relied on this fallback will no longer establish successfully.
  • +
  • Endpoint Connections: Automatic fallback to locational endpoints in certain regions is removed.

Important: If you are not ready to adapt to these changes, please pin your dependency to a version less than 2.0 (e.g., bigframes==1.38.0) to avoid disruption.

Please review the updated introduction guide for more details on v2.0 behavior.

From d2b145df65aa90c9116dd81e95b6fca460648483 Mon Sep 17 00:00:00 2001 From: Huan Chen Date: Thu, 27 Mar 2025 23:18:31 +0000 Subject: [PATCH 4/9] update text --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index c469e19352..00cbd27973 100644 --- a/README.rst +++ b/README.rst @@ -15,7 +15,7 @@ BigQuery DataFrames
  • Endpoint Connections: Automatic fallback to locational endpoints in certain regions is removed.
  • Important: If you are not ready to adapt to these changes, please pin your dependency to a version less than 2.0 (e.g., bigframes==1.38.0) to avoid disruption.

    -

    Please review the updated introduction guide for more details on v2.0 behavior.

    +

    To learn about these changes and how to migrate to version 2.0, see: updated introduction guide.

    BigQuery DataFrames provides a Pythonic DataFrame and machine learning (ML) API From 521d01bb949b547805bd94907da43a483005d0d0 Mon Sep 17 00:00:00 2001 From: Huan Chen Date: Fri, 28 Mar 2025 02:00:43 +0000 Subject: [PATCH 5/9] upadte based on comment --- README.rst | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 00cbd27973..301efbfa6d 100644 --- a/README.rst +++ b/README.rst @@ -11,10 +11,11 @@ BigQuery DataFrames

    Version 2.0 introduces breaking changes for improved security and performance. Key default behaviors have changed:

    • Large Results (>10GB): The default value for allow_large_results has changed to False. Methods like to_pandas() will now fail if the query result's compressed data size exceeds 10GB, unless large results are explicitly permitted.
    • -
    • Remote Function Security: The library no longer defaults to using the Compute Engine service account, and network ingress now defaults to "internal-only".
    • +
    • Remote Function Security:The library no longer automatically lets the Compute Engine default service account become the identity of the Cloud Run functions. If that is desired, it has to be indicated by passing cloud_function_service_account="default". And network ingress now defaults to "internal-only".
    • +
    • @remote_function Argument Passing: Arguments to decorated functions must now be passed using keyword syntax, as positional arguments are no longer supported.
    • Endpoint Connections: Automatic fallback to locational endpoints in certain regions is removed.
    -

    Important: If you are not ready to adapt to these changes, please pin your dependency to a version less than 2.0 (e.g., bigframes==1.38.0) to avoid disruption.

    +

    Important: If you are not ready to adapt to these changes, please pin your dependency to a version less than 2.0 (e.g., bigframes==1.42.0) to avoid disruption.

    To learn about these changes and how to migrate to version 2.0, see: updated introduction guide.

    From d12a7496a2422dc40742bb73ac127f20ba11783d Mon Sep 17 00:00:00 2001 From: Huan Chen Date: Fri, 28 Mar 2025 02:07:37 +0000 Subject: [PATCH 6/9] update wording --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 301efbfa6d..23f399c710 100644 --- a/README.rst +++ b/README.rst @@ -12,7 +12,7 @@ BigQuery DataFrames
    • Large Results (>10GB): The default value for allow_large_results has changed to False. Methods like to_pandas() will now fail if the query result's compressed data size exceeds 10GB, unless large results are explicitly permitted.
    • Remote Function Security:The library no longer automatically lets the Compute Engine default service account become the identity of the Cloud Run functions. If that is desired, it has to be indicated by passing cloud_function_service_account="default". And network ingress now defaults to "internal-only".
    • -
    • @remote_function Argument Passing: Arguments to decorated functions must now be passed using keyword syntax, as positional arguments are no longer supported.
    • +
    • @remote_function Argument Passing: Arguments to remote_function must now be passed using keyword syntax, as positional arguments are no longer supported.
    • Endpoint Connections: Automatic fallback to locational endpoints in certain regions is removed.

    Important: If you are not ready to adapt to these changes, please pin your dependency to a version less than 2.0 (e.g., bigframes==1.42.0) to avoid disruption.

    From 27e4bb847cbe52eb8fb78d420e44009b61a238ab Mon Sep 17 00:00:00 2001 From: Huan Chen Date: Fri, 28 Mar 2025 02:19:42 +0000 Subject: [PATCH 7/9] move to lower --- README.rst | 18 +++++++++--------- 1 file changed, 9 insertions(+), 9 deletions(-) diff --git a/README.rst b/README.rst index 23f399c710..00e09eff3a 100644 --- a/README.rst +++ b/README.rst @@ -3,6 +3,15 @@ BigQuery DataFrames |GA| |pypi| |versions| +BigQuery DataFrames provides a Pythonic DataFrame and machine learning (ML) API +powered by the BigQuery engine. + +* ``bigframes.pandas`` provides a pandas-compatible API for analytics. +* ``bigframes.ml`` provides a scikit-learn-like API for ML. + +BigQuery DataFrames is an open-source package. You can run +``pip install --upgrade bigframes`` to install the latest version. + .. raw:: html
    To learn about these changes and how to migrate to version 2.0, see: updated introduction guide.

    -BigQuery DataFrames provides a Pythonic DataFrame and machine learning (ML) API -powered by the BigQuery engine. - -* ``bigframes.pandas`` provides a pandas-compatible API for analytics. -* ``bigframes.ml`` provides a scikit-learn-like API for ML. - -BigQuery DataFrames is an open-source package. You can run -``pip install --upgrade bigframes`` to install the latest version. - .. |GA| image:: https://img.shields.io/badge/support-GA-gold.svg :target: https://github.com/googleapis/google-cloud-python/blob/main/README.rst#general-availability .. |pypi| image:: https://img.shields.io/pypi/v/bigframes.svg From 4d49cb4effa3e7513fec55b95b904f71853365e5 Mon Sep 17 00:00:00 2001 From: Huan Chen Date: Fri, 28 Mar 2025 02:38:24 +0000 Subject: [PATCH 8/9] Add llm change --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 00e09eff3a..acc7e96a6c 100644 --- a/README.rst +++ b/README.rst @@ -23,6 +23,7 @@ BigQuery DataFrames is an open-source package. You can run
  • Remote Function Security:The library no longer automatically lets the Compute Engine default service account become the identity of the Cloud Run functions. If that is desired, it has to be indicated by passing cloud_function_service_account="default". And network ingress now defaults to "internal-only".
  • @remote_function Argument Passing: Arguments to remote_function must now be passed using keyword syntax, as positional arguments are no longer supported.
  • Endpoint Connections: Automatic fallback to locational endpoints in certain regions is removed.
  • +
  • LLM Changes (Shift to Gemini): Integrations now default to gemini-2.0-flash-001. Support for PaLM2 models has been removed; migrate any PaLM2 usage to Gemini.
  • Important: If you are not ready to adapt to these changes, please pin your dependency to a version less than 2.0 (e.g., bigframes==1.42.0) to avoid disruption.

    To learn about these changes and how to migrate to version 2.0, see: updated introduction guide.

    From 559213f2d3f2e5aee414688dfb444d4c93992706 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Tim=20Swe=C3=B1a=20=28Swast=29?= Date: Fri, 28 Mar 2025 11:42:32 -0500 Subject: [PATCH 9/9] Update README.rst --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index acc7e96a6c..2fd558119a 100644 --- a/README.rst +++ b/README.rst @@ -21,7 +21,7 @@ BigQuery DataFrames is an open-source package. You can run
    • Large Results (>10GB): The default value for allow_large_results has changed to False. Methods like to_pandas() will now fail if the query result's compressed data size exceeds 10GB, unless large results are explicitly permitted.
    • Remote Function Security:The library no longer automatically lets the Compute Engine default service account become the identity of the Cloud Run functions. If that is desired, it has to be indicated by passing cloud_function_service_account="default". And network ingress now defaults to "internal-only".
    • -
    • @remote_function Argument Passing: Arguments to remote_function must now be passed using keyword syntax, as positional arguments are no longer supported.
    • +
    • @remote_function Argument Passing: Arguments other than input_types, output_type, and dataset to remote_function must now be passed using keyword syntax, as positional arguments are no longer supported.
    • Endpoint Connections: Automatic fallback to locational endpoints in certain regions is removed.
    • LLM Changes (Shift to Gemini): Integrations now default to gemini-2.0-flash-001. Support for PaLM2 models has been removed; migrate any PaLM2 usage to Gemini.