This topic describes how to go to the details page of a table and view the details about the table, such as the basic information, output information, and lineage information.

Go to the details page of a table

  1. Log on to the DataWorks console.
  2. In the left-side navigation pane, click Workspaces.
  3. Find the desired workspace and click Data Analytics in the Actions column.
  4. On the DataStudio page, click the Icon icon in the upper-left corner and choose All Products > Data governance > DataMap.
  5. In the top navigation bar, click All Data.
  6. In the upper part of the All Data tab, select a data source type from the drop-down list. In this topic, MaxCompute is selected.
  7. Find the card of the table that you want to view and click the table name to go to the table details page.

View the details of a table

On the table details page, the following information is displayed: Table Basic Information, Business Information, Permission Information, Technical Information, Details, Instances, Lineage, Records, Data Preview, Data Profiling, and Usage Notes. Table details
Section Description
1 In this section, you can perform the following operations:
  • Apply for permissions on tables in Security Center and view the application records in DataMap. For more information, see Request permissions on tables.
  • Add tables to favorites or remove tables from favorites.
  • Generate APIs or register APIs in DataService Studio. For more information, see Overview.
2 In this section, you can search for a table by keyword such as table name, field name, or project name.
3 In this section, you can view the following information of the table:
  • Table Basic Information: You can view the information such as Data Reads, Add to Favorites, and Views. You can also click View Code next to Output Node to view the code of the node that generates that table.
  • Business Information: You can view the information such as DataWorks Workspace, Environment, and Category.
  • Permission Information: You can view the permissions that you are granted on the table. You can also click View More in the upper-right corner of this section to go to the Permission application tab of the Data access control page and apply for more permissions on the table.
  • Technical Information: You can view the information such as Type. You can also click View next to Compute Engine Information to view or copy the compute engine information.
4 In this section, you can view the following information of the table:
  • Details: You can view the following information of the table on this tab: Field Information, Partitions, and Change Records. For more information, see View information on the Details tab.
  • Instances: If the table data periodically changes with the node that generates the table, you can view the change status and the data that is continuously updated on this tab.
  • Lineage: You can view the inner lineages of the node that generates the table. If the current table is used as the data source of an API, you can also view the lineages between the table and the API. MaxCompute allows you to view the complete lineages of a batch synchronization node that is used to synchronize data to MaxCompute. For more information, see View lineage information on the Lineage tab.
    Note For more information about how to view the complete lineage of a DataService Studio API, see View the details of an API.
  • Records: You can view the reference and access records of the table on the following subtabs:
    • Foreign Key References: On this subtab, you can view the number of users who reference the current table.
    • Access Statistics: On this subtab, you can view the reference records in a line chart.
  • Data Preview: You can preview data in the current table on this tab.
    Notice
    • You can preview the tables that are in the production environment only if you have the required permissions. For more information about how to apply for permissions on tables, see Request permissions on tables.
    • If the preview feature for tables in the development environment and that for tables in the production environment are enabled for the workspace to which the table belongs on the Configuration Management page, you can preview the table data on the Data Preview tab without the need to apply for permissions on tables in Security Center.
    • If you configure data masking rules and the data masking rules are in effect, the Data Preview tab displays data based on the data masking rules. For more information about how to configure data masking rules, see Customize de-identification rules.
    • The Data Preview tab cannot display data in external tables.
  • Data Profiling: DataWorks detects the data of a table based on the schema and a partition key value. The data profiling results include basic statistical information and data distribution. For more information, see Perform data profiling on the Data Profiling tab.
  • Usage Notes: You can click Edit, View Versions, or View Markdown Syntax to view the related information.

View information on the Details tab

The Details tab contains the following subtabs: Field Information, Partitions, and Change Records.
  • Field Information: You can view the field information of the table on this subtab. If the table is a partitioned table, you can also view Partition Key Fields in the table. Field Information
    Operation Description
    Edit You can click this button to modify Description, Business Description, Security Level, and Primary Key. You can also save the modified information or cancel the modification.
    You can specify a security level for multiple fields at a time.
    Note The security level feature is exclusive to MaxCompute. The Security Level column is displayed on the Field Information tab only for MaxCompute tables for which you specify field security levels.
    Upload You can click this button and drag the file that you want to upload from your on-premises machine to the Batch Upload Field Information dialog box.

    Only .xlsx files created in Excel 2007 are supported. You can also click Download Template File to download the template file.

    Download You can click this button to download the field information of the table.
    Generate SELECT Statement You can click this button to view or copy the SELECT statement that is used to query the table data in the Generate SELECT Statement dialog box.
    Generate DDL Statement You can click this button to view or copy the data definition language (DDL) statement that is used to create the table in the Generate DDL Statement dialog box.
  • Partitions: You can view the information such as Partition Name and Records on this subtab. Partitions
  • Change Records: You can view the information such as Description, Change Type, and Object. Change Records

    You can select a change type from the drop-down list on the Change Records subtab to view the related table changes.

    Change types include Create Table, Modify Table, Delete Table, Create Partition, Delete Partition, Change Owner, and Change TTL.

View lineage information on the Lineage tab

The Lineage tab displays the inner lineages of the node that generates the table. MaxCompute also allows you to view the complete lineages of a batch synchronization node that is used to synchronize data to MaxCompute. You can view the ancestor and descendant tables of a MaxCompute table. You can also expand the lineage levels of the MaxCompute table to view the sources and destinations of the table.
Note The lineage feature is supported only in DataWorks Standard Edition or a more advanced edition. If the compute engine is MaxCompute, this feature is available only in DataWorks Standard Edition or a more advanced edition. If the compute engine is E-MapReduce (EMR), this feature is available only in DataWorks Professional Edition or a more advanced edition.
On the Lineage tab, you can view Table Lineage, Field Lineage, and Impact Analysis.
  • The Table Lineage tab consists of the Graph Analysis and View by Level subtabs.
    • On the Graph Analysis subtab, you can view the number of ancestor and descendant tables at all levels of the table. You can also view the total number of ancestor and descendant tables for each table. Lineage
    • By default, you can view the ancestor and descendant tables at the nearest level of the table on the View by Level subtab. You can search for ancestor and descendant tables based on their globally unique identifiers (GUIDs). View by Level
  • On the Field Lineage subtab, you can select a field in the table from the Field Name drop-down list to view the lineage information of the field. Field Lineage
  • On the Impact Analysis subtab, you can specify one of the following conditions to view Scheduling Output, or Full Link of the lineage: Lineage Level, Field Lineage Node Type, Table Name, Project Name, and Table Owner. Impact Analysis

    You can click Start Analysis to perform impact analysis. After the analysis is complete, you can download the impact analysis result. You can also enable the system to send the impact analysis result to the owners of descendant tables of the current table by email.

Perform data profiling on the Data Profiling tab

Notice The data profiling feature is in public preview and is supported only by MaxCompute compute engine instances that reside in the China (Shanghai) region.

DataWorks detects the data of a table based on the schema and a partition key value. The data profiling results include basic statistical information and data distribution.

Limits:
  • You can detect only data in partitioned tables.
  • You can detect only tables that are in the production environment.
  • Only the table owner can enable the Auto Profiling feature.
On the Data Profiling tab, you can configure a profiling mode and view data profiling records. Data Profiling
Data profiling supports Manual Profiling and Auto Profiling.
  • Manual Profiling
    Note Data profiling tasks run in the MaxCompute project to which a detected table belongs. The system can detect a maximum of 10 columns in a table at a time. To save resources, select only the columns that you want to detect.
    To enable manual profiling, perform the following steps:
    1. On the Data Profiling tab, click Manual Profiling.
    2. In the Manual Profiling dialog box, configure the following parameters. Manual Profiling
      Parameter Description
      Table Name The name of the table. The format is Workspace name.Table name. The value of this parameter is generated by the system and cannot be changed.
      Partition Value The partition that you want to detect. Select a partition from the Partition Value drop-down list.
      Detailed Configuration The selected columns.
      Estimated Cost The estimated cost for running the data profiling task. The cost is estimated based on the settings of the preceding parameters.
      Notice To detect data in the MaxCompute table, you must execute MaxCompute SQL statements. In this case, you will be charged for using the MaxCompute service. The estimated cost is for reference only. The actual cost may vary based on the amount of data that is detected. You can check bills for MaxCompute for the actual cost.
    3. Select I understand that using this service will be charged.
    4. Click Commit.
    5. After the data profiling task is completed, view the data profiling results on the Data Profiling tab.

      You can select an option from the Profiling Records drop-down list to view the desired data profiling result. You can choose Data Distribution > Value range to view the distribution of data values in a field.

  • Auto Profiling
    To enable auto profiling, perform the following steps:
    1. Turn on Auto Profiling.
    2. In the Auto Profiling (When Partition Information Changes) dialog box, configure the following parameters. Auto Profiling
      Parameter Description
      Table Name The name of the table. The format is Workspace name.Table name. The value of this parameter is generated by the system and cannot be changed.
      Partition Value By default, the partition value is the latest partition value when data profiling is triggered. You cannot change the value.
      Detailed Configuration The selected columns.
      Bind Trigger The auto triggered node that triggers auto profiling. You must select an auto triggered node from the Bind Trigger drop-down list. You can view the IDs of auto triggered nodes in Operation Center. We recommend that you select the node that generates the current table.

      After you select the metrics based on which you want to detect the table data and submit the auto profiling task, the system runs the auto profiling task to detect the latest partition in the table after the auto triggered node is successfully run.

      Estimated Cost The estimated cost for running the data profiling task. The cost is estimated based on the settings of the preceding parameters.
      Notice To detect data in the MaxCompute table, you must execute MaxCompute SQL statements. In this case, you will be charged for using the MaxCompute service. The estimated cost is for reference only. The actual cost may vary based on the amount of data that is detected. You can check bills for MaxCompute for the actual cost.
    3. Select I understand that using this service will be charged.
    4. Click Commit.
    5. After the data profiling task is completed, view the data profiling results on the Data Profiling tab.

      You can select an option from the Profiling Records drop-down list to view the desired data profiling result.