Databricks Runtime 8.0 for ML(不支持)

Databricks于2021年3月发布了这张图片。

Databricks Runtime 8.0 for Machine Learning为机器学习和数据科学提供了一个现成的环境Databricks Runtime 8.0(不支持).Databricks Runtime ML包含许多流行的机器学习库,包括TensorFlow、PyTorch和XGBoost。它还支持使用Horovod进行分布式深度学习训练。

有关更多信息,包括创建Databricks Runtime ML集群的说明,请参见Databricks运行时机器学习

新特性和主要变化

Databricks Runtime 8.0 ML是建立在Databricks Runtime 8.0之上的。有关Databricks Runtime 8.0(包括Apache Spark MLlib和SparkR)中的新特性的信息,请参见Databricks Runtime 8.0(不支持)发行说明。

Conda通道配置

2020年9月,Anaconda公司更新了他们的服务条款用于anaconda.org频道。根据新的服务条款,如果您依赖Anaconda的包装和分销,您可能需要商业许可证。访问水蟒商业版常见问题解答获取更多信息。由于这个更改,我们删除了Databricks Runtime ML 8.0中Conda包管理器的默认通道配置。方法安装或更新包% conda命令时,必须指定通道。您使用任何水蟒频道是受他们的服务条款。

Databricks Runtime ML Python环境的主要更改

看到Databricks Runtime 8.0(不支持)参阅Databricks Runtime Python环境的主要更改。有关已安装的Python包及其版本的完整列表,请参见Python库

环境的重大变化

  • 默认的Conda通道已被删除。

  • 系统默认Python版本已从3.7.6更新到3.8.5。

  • TensorFlow 1。不再支持X。

Python包升级

  • Tensorboard 2.3.1 -> 2.4.1

  • Tensorflow 2.3.1 -> 2.4.0

  • Matplotlib 3.1.3 -> 3.2.2

  • Joblib 0.14.1 -> 0.17.0

  • 风暴0.9.7 -> 0.9.8

  • Cloudpickle 1.4.1 -> 1.6.0

  • NLTK 3.4.5 -> 3.5

  • Anaconda发行版中的软件包已从2020.02升级到2020.11

新增Python包

  • 世鹏科技电子:0.37.0

删除Python包

  • 大猩猩

  • 补丁

系统环境

Databricks Runtime 8.0 ML的系统环境与Databricks Runtime 8.0不同:

以下部分列出了Databricks Runtime 8.0 ML中包含的不同于Databricks Runtime 8.0的库。

Python库

Databricks Runtime 8.0 ML使用Conda进行Python包管理,包括许多流行的ML包。

除了以下章节中Conda环境中指定的包外,Databricks Runtime 8.0 ML还包括以下包:

  • hyperopt 0.2.5.db1

  • sparkdl 2.1.0.db4

CPU集群上的Python库

的名字databricks-ml渠道-pytorch-违约依赖关系-_libgcc_mutex = 0.1 =主要-absl-py = 0.11.0 = pyhd3eb1b0_1它-aiohttp = 3.6.3 = py38h7b6447c_0-asn1crypto = 1.4.0 = py_0-阿斯特= 0.8.1 = py38_0-async-timeout = 3.0.1 = py38_0-attrs = 20.3.0 = pyhd3eb1b0_0-backcall = 0.2.0 = py_0-bcrypt = 3.2.0 = py38h7b6447c_0-布拉斯特区= 1.0 = mkl-信号灯= 1.4 = py38_0-boto3 = 1.16.7 = pyhd3eb1b0_0-botocore = 1.19.7 = pyhd3eb1b0_0-brotlipy = 0.7.0 = py38h27cfd23_1003-c-ares = 1.17.1 = h27cfd23_0-ca证书= 2021.4.13 = h06a4308_1#(2021年5月26日维护更新,从2021.1.19更新)-cachetools = 4.2.0 = pyhd3eb1b0_0-certifi = 2020.12.5 = py38h06a4308_0-cffi = 1.14.3 = py38h261ae71_2-chardet = 3.0.4 = py38h06a4308_1003-单击= 7.1.2 = pyhd3eb1b0_0-cloudpickle = 1.6.0 = py_0-configparser = 5.0.1 = py_0-cpuonly = 1.0 = 0-密码= 3.1.1 = py38h1ba5d50_0-周期计= 0.10.0 = py38_0-cython = 0.29.21 = py38h2531618_0-decorator = 10/24/11 = py_0-莳萝= 0.3.2 = py_0-docutils = 0.15.2 = py38_0-entrypoints = 0.3 = py38_0-瓶= 1.1.2 = py_0-freetype的= 2.10.4 = h5ab3b9f_0-未来= 0.18.2 = py38_1-gitdb =你= py_0-gitpython = 3.1.11 = pyhd3eb1b0_1-google-auth = 1.22.1 = py_0-google-auth-oauthlib = 0.4.2 = pyhd3eb1b0_2-google-pasta = 0.2.0 = py_0-gunicorn = 20.0.4 = py38_0-h5py = 2.10.0 = py38h7918eee_0-hdf5 = 1.10.4 = hb1b8bf9_0-icu = 58.2 = he6710b0_3-idna = 2.10 = pyhd3eb1b0_0-importlib-metadata = 2.0.0 = py_1-intel-openmp = 2019.4 = 243-ipykernel = 5.3.4 = py38h5ca1d4c_0-ipython = 7.19.0 = py38hb070fc8_1-ipython_genutils = 0.2.0 = pyhd3eb1b0_1-isodate = 0.6.0 = py_1-itsdangerous = 1.1.0 = py_0-绝地= 0.17.2 = py38h06a4308_1-jinja2 = 2.11.2 = pyhd3eb1b0_0-jmespath = 0.10.0 = py_0-joblib = 0.17.0 = py_0-jpeg = 9 b = h024ee3a_2-jupyter_client = 6.1.7 = py_0-jupyter_core = 4.6.3 = py38_0-= py38h2531618_0 1.3.0 kiwisolver =版本-krb5 = 1.17.1 = h173b8e3_0-lcms2 = 2.11 = h396b838_0-ld_impl_linux - 64 = 2.33.1 = h53a641e_7-libedit = 3.1.20191231 = h14c3975_1-libffi = 3.3 = he6710b0_2-libgcc-ng = 9.1.0 = hdf63c60_0-libgfortran-ng = 7.3.0 = hdf63c60_0-libpng = 1.6.37 = hbc83047_0-libpq = 12.2 = h20c2e04_0-libprotobuf = 3.13.0.1 = hd408876_0-libsodium = 1.0.18 = h7b6447c_0-libstdcxx-ng = 9.1.0 = hdf63c60_0-libtiff = 4.1.0 = h2733197_1-lightgbm = 3.1.1 = py38h2531618_0-lz4-c = 1.9.2 = heb0550a_3-尖吻鲭鲨= 1.1.3 = py_0-减价= 3.3.2 = py38_0-markupsafe = 1.1.1 = py38h7b6447c_0-matplotlib-base = 3.2.2 = py38hef1b27d_0-mkl = 2019.4 = 243-mkl-service = tripwire = py38he904b0f_0-mkl_fft = 1.2.0 = py38h23d657b_0-mkl_random = 1.1.0 = py38h962f231_0-more-itertools = 8.6.0 = pyhd3eb1b0_0-multidict = 4.7.6 = py38h7b6447c_1-ncurses = 6.2 = he6710b0_1-networkx = 2.5 = py_0-nltk = 3.5 = py_0-numpy = 1.19.2 = py38h54aff64_0-numpy-base = 1.19.2 = py38hfa32c7d_0-oauthlib = 3.1.0 = py_0-olefile = 0.46 = py_0-openssl = 1.1.1k = h27cfd23_0#(2021年5月26日维护更新从1.1.1i更新)-包装= 20.4 = py_0-熊猫= 1.1.3 = py38he6710b0_0-paramiko = 2.7.2 = py_0-parso = 0.7.0 = py_0-容易受骗的人= 0.5.1 = py38_0-pexpect = 4.8.0 = pyhd3eb1b0_3-pickleshare = 0.7.5 = pyhd3eb1b0_1003-枕头= 8.0.1 = py38he98fc37_0-皮普= 20.2.4 = py38h06a4308_0-情节= 4.14.2 = pyhd3eb1b0_0-prompt-toolkit = 3.0.8 = py_0-prompt_toolkit = 3.0.8 = 0-protobuf = 3.13.0.1 = py38he6710b0_1-psutil = 5.7.2 = py38h7b6447c_0-psycopg2 = 2.8.5 = py38h3c74f83_1-ptyprocess = 0.6.0 = pyhd3eb1b0_2-pyasn1 = 0.4.8 = py_0-pyasn1-modules = 0.2.8 = py_0-pycparser = 2.20 = py_2-pygments = 2.7.2 = pyhd3eb1b0_0-pyjwt = 1.7.1上= py38_0-pynacl = 1.4.0 = py38h7b6447c_1-pyodbc = 4.0.30 = py38he6710b0_0-pyopenssl = 19.1.0 = pyhd3eb1b0_1-pyparsing = 2.4.7 = pyhd3eb1b0_0-pysocks = 1.7.1上= py38h06a4308_0-python = 3.8.8 = hdb3f193_4#(2021年5月26日维护更新从3.8.5更新)-python-dateutil = 2.8.1发布= py_0-python编辑器的1.0.4 = = py_0-pytz = 2020.1 = py_0-pyzmq = 19.0.2 = py38he6710b0_1-readline = 8.0 = h7b6447c_0-regex = 2020.10.15 = py38h7b6447c_0-= 2.24.0 = py_0请求-= py_0 1.3.0 requests-oauthlib =版本-重试= 1.3.3 = py_2-rsa = 4.7 = pyhd3eb1b0_1-s3transfer = 0.3.4 = pyhd3eb1b0_0-scikit-learn = 0.23.2 = py38h0573a6f_0-scipy = 1.5.2 = py38h0b6359f_0-setuptools = 50.3.1 = py38h06a4308_1-simplejson = 3.17.2 = py38h7b6447c_0-6 = 1.15.0 = py38h06a4308_0-smmap = 3.0.4 = py_0-sqlite = 3.33.0 = h62c20be_0-sqlparse = 0.4.1 = py_0-statsmodels = 0.12.0 = py38h7b6447c_0-汇总= 0.8.7 = py38_0-threadpoolctl = 2.1.0 = pyh5ca1d4c_0-tk = 8.6.10 = hbc83047_0-龙卷风= 6.0.4 = py38h7b6447c_1-tqdm = 4.50.2 = py_0-traitlets = 5.0.5 = py_0-typing_extensions = 3.7.4.3 = py_0-unixodbc = 2.3.9 = h7b6447c_0-urllib3 = 1.25.11 = py_0-wcwidth = 0.2.5 = py_0-websocket-client = 0.57.0 = py38_2-werkzeug = 1.0.1 = py_0-轮= 0.35.1 = pyhd3eb1b0_0-打包= 1.12.1 = py38h7b6447c_1-xz = 5.2.5 = h7b6447c_0-yarl = 1.6.3 = py38h27cfd23_0-zeromq = 4.3.3 = he6710b0_3-氧化锌碘仿糊= 3.4.0 = pyhd3eb1b0_0-zlib = 1.2.11 = h7b6447c_3-zstd = 1.4.5 = h9ceee32_0-皮普-astunparse = = 1.6.3-azure-core = = 1.10.0-azure-storage-blob = = 12.7.0-databricks-cli = = 0.14.1-diskcache = = 5.1.0-4.4.1码头工人= =-flatbuffers = = 1.12-恐吓= = 0.3.3-grpcio = = 1.32.0-horovod = = 0.21.1-joblibspark = = 0.3.0-keras-preprocessing = = 1.1.2-考拉= = 1.5.0-llvmlite = = 0.35.0-mleap = = 0.16.1-mlflow = = 1.13.1-msr = = 0.6.19-numba = = 0.52.0-opt-einsum = = 3.3.0-petastorm = = 0.9.8-pyarrow = = 1.0.1-pyyaml = = 5.4-querystring-parser = = 4-seaborn = = 0.10.0-世鹏科技电子= = 0.37.0-切片机= = 0.0.3-spark-tensorflow-distributor = = 0.1.0-tensorboard = = 2.4.1-tensorboard-plugin-wit = = 1.8.0-tensorflow-cpu = =测试盒框-tensorflow-estimator = =测试盒框-termcolor = = 1.1.0-火炬= = 1.7.1上-torchvision = = 0.8.2-xgboost = = 1.3.1前缀/砖/ conda / env / databricks-ml

包含Python模块的Spark包

火花包

Python模块

版本

graphframes

graphframes

0.8.1-db2-spark3.1

R库

R库与R库在Databricks Runtime 8.0。

Java和Scala库(Scala 2.12集群)

除了Java和Scala库在Databricks Runtime 8.0, Databricks Runtime 8.0 ML包含以下jar:

CPU集群

组ID

工件ID

版本

com.typesafe.akka

akka-actor_2.12

2.5.23

ml.combust.mleap

mleap-databricks-runtime_2.12

0.17.0-4882dc3

ml.dmlc

xgboost4j-spark_2.12

1.3.1

ml.dmlc

xgboost4j_2.12

1.3.1

org.graphframes

graphframes_2.12

0.8.1-db2-spark3.1

org.mlflow

mlflow-client

1.13.1

org.mlflow

mlflow-spark

1.13.1

org.scala-lang.modules

scala-java8-compat_2.12

0.8.0

org.tensorflow

spark-tensorflow-connector_2.12

1.15.0