API Reference google_api_dataproc v0.48.0

Modules

API client metadata for GoogleApi.Dataproc.V1.

API calls for all endpoints tagged Projects.

Handle Tesla connections for GoogleApi.Dataproc.V1.

Specifies the type and number of accelerator cards attached to the instances of an instance. See GPUs on Compute Engine (https://cloud.google.com/compute/docs/gpus/).

Autoscaling Policy config associated with the cluster.

Describes an autoscaling policy for Dataproc cluster autoscaler.

Basic autoscaling configurations for YARN.

Metadata describing the Batch operation.

Associates members with a role.

Describes the identifying information, config, and status of a cluster of Compute Engine instances.

Contains cluster daemon metrics, such as HDFS and YARN stats.Beta Feature: This report is available for testing purposes only. It may be changed before final release.

The cluster operation triggered by a workflow.

Metadata describing the operation.

A selector that chooses target cluster for jobs based on metadata.

The status of a cluster and its instances.

Confidential Instance Config for clusters using Confidential VMs (https://cloud.google.com/compute/confidential-vm/docs) NEXT ID: 2

A request to collect cluster diagnostic information.

The location of diagnostic output.

Specifies the config of disk options for a group of VM instances.

A generic empty message that you can re-use to avoid defining duplicated empty messages in your APIs. A typical example is to use it as the request or the response type of an API method. For instance: service Foo { rpc Bar(google.protobuf.Empty) returns (google.protobuf.Empty); } The JSON representation for Empty is empty JSON object {}.

Encryption settings for the cluster.

Endpoint config for this cluster

Represents a textual expression in the Common Expression Language (CEL) syntax. CEL is a C-like expression language. The syntax and semantics of CEL are documented at https://github.com/google/cel-spec.Example (Comparison): title: "Summary size limit" description: "Determines if a summary is less than 100 chars" expression: "document.summary.size() < 100" Example (Equality): title: "Requestor is owner" description: "Determines if requestor is the document owner" expression: "document.owner == request.auth.claims.email" Example (Logic): title: "Public documents" description: "Determine whether the document should be publicly visible" expression: "document.type != 'private' && document.type != 'internal'" Example (Data Manipulation): title: "Notification string" description: "Create a notification string with a timestamp." expression: "'New message received at ' + string(document.create_time)" The exact variables and functions that may be referenced within an expression are determined by the service that evaluates it. See the service documentation for additional information.

Common config settings for resources of Compute Engine cluster instances, applicable to all instances in the cluster.

Request message for GetIamPolicy method.

Encapsulates settings provided to GetIamPolicy.

The GKE config for this cluster.

A Dataproc job for running Apache Hive (https://hive.apache.org/) queries on YARN.

Identity related configuration, including service account based secure multi-tenancy user mappings.

A request to inject credentials into a cluster.

Configuration for the size bounds of an instance group, including its proportional size to other groups.

The config settings for Compute Engine resources in an instance group, such as a master or worker group.

A reference to a Compute Engine instance.

A request to instantiate a workflow template.

A Dataproc job resource.

Job Operation metadata.

Encapsulates the full scoping used to reference a job.

Specifies Kerberos related configuration.

Specifies the cluster auto-delete schedule configuration.

A response to a request to list autoscaling policies in a project.

The list of all clusters in a project.

A list of jobs in a project.

The response message for Operations.ListOperations.

A response to a request to list workflow templates in a project.

The runtime logging config of the job.

Cluster that is managed by the workflow.

Specifies the resources used to actively manage an instance group.

Specifies a Metastore configuration.

A full, namespace-isolated deployment target for an existing GKE cluster.

Node Group Affinity for clusters using sole-tenant node groups.

Specifies an executable to run on a fully configured node and a timeout period for executable completion.

This resource represents a long-running operation that is the result of a network API call.

A job executed by the workflow.

Configuration for parameter validation.

A Dataproc job for running Apache Pig (https://pig.apache.org/) queries on YARN.

An Identity and Access Management (IAM) policy, which specifies access controls for Google Cloud resources.A Policy is a collection of bindings. A binding binds one or more members to a single role. Members can be user accounts, service accounts, Google groups, and domains (such as G Suite). A role is a named list of permissions; each role can be an IAM predefined role or a user-created custom role.For some types of Google Cloud resources, a binding can also specify a condition, which is a logical expression that allows access to a resource only if the expression evaluates to true. A condition can add constraints based on attributes of the request, the resource, or both. To learn which resources support conditions in their IAM policies, see the IAM documentation (https://cloud.google.com/iam/help/conditions/resource-policies).JSON example: { "bindings": [ { "role": "roles/resourcemanager.organizationAdmin", "members": [ "user:mike@example.com", "group:admins@example.com", "domain:google.com", "serviceAccount:my-project-id@appspot.gserviceaccount.com" ] }, { "role": "roles/resourcemanager.organizationViewer", "members": [ "user:eve@example.com" ], "condition": { "title": "expirable access", "description": "Does not grant access after Sep 2020", "expression": "request.time < timestamp('2020-10-01T00:00:00.000Z')", } } ], "etag": "BwWWja0YfJA=", "version": 3 } YAML example: bindings: - members: - user:mike@example.com - group:admins@example.com - domain:google.com - serviceAccount:my-project-id@appspot.gserviceaccount.com role: roles/resourcemanager.organizationAdmin - members: - user:eve@example.com role: roles/resourcemanager.organizationViewer condition: title: expirable access description: Does not grant access after Sep 2020 expression: request.time < timestamp('2020-10-01T00:00:00.000Z') - etag: BwWWja0YfJA= - version: 3 For a description of IAM and its features, see the IAM documentation (https://cloud.google.com/iam/docs/).

A Dataproc job for running Presto (https://prestosql.io/) queries. IMPORTANT: The Dataproc Presto Optional Component (https://cloud.google.com/dataproc/docs/concepts/components/presto) must be enabled when the cluster is created to submit a Presto job to the cluster.

A Dataproc job for running Apache PySpark (https://spark.apache.org/docs/0.9.0/python-programming-guide.html) applications on YARN.

A list of queries to run on a cluster.

Validation based on regular expressions.

Reservation Affinity for consuming Zonal reservation.

Security related configuration, including encryption, Kerberos, etc.

Request message for SetIamPolicy method.

Shielded Instance Config for clusters using Compute Engine Shielded VMs (https://cloud.google.com/security/shielded-cloud/shielded-vm).

Specifies the selection and config of software inside the cluster.

A Dataproc job for running Apache Spark (http://spark.apache.org/) applications on YARN.

A Dataproc job for running Apache SparkR (https://spark.apache.org/docs/latest/sparkr.html) applications on YARN.

A Dataproc job for running Apache Spark SQL (http://spark.apache.org/sql/) queries.

A request to start a cluster.

The Status type defines a logical error model that is suitable for different programming environments, including REST APIs and RPC APIs. It is used by gRPC (https://github.com/grpc). Each Status message contains three pieces of data: error code, error message, and error details.You can find out more about this error model and how to work with it in the API Design Guide (https://cloud.google.com/apis/design/errors).

A request to stop a cluster.

A configurable parameter that replaces one or more fields in the template. Parameterizable fields: - Labels - File uris - Job properties - Job arguments - Script variables - Main class (in HadoopJob and SparkJob) - Zone (in ClusterSelector)

Request message for TestIamPermissions method.

Response message for TestIamPermissions method.

Validation based on a list of allowed values.

A Dataproc workflow template resource.

A Dataproc workflow template resource.

Specifies workflow execution target.Either managed_cluster or cluster_selector is required.

A YARN application created by a job. Application information is a subset of org.apache.hadoop.yarn.proto.YarnProtos.ApplicationReportProto.Beta Feature: This report is available for testing purposes only. It may be changed before final release.