Skip to content

Commit 46698d6

Browse files
Sud0x67SbloodyS
andauthored
[Chore][Doc] Fix typos and improve the expression in glossary.md. (#17380)
Co-authored-by: xiangzihao <460888207@qq.com>
1 parent 291adfb commit 46698d6

1 file changed

Lines changed: 28 additions & 26 deletions

File tree

docs/docs/en/about/glossary.md

Lines changed: 28 additions & 26 deletions
Original file line numberDiff line numberDiff line change
@@ -5,47 +5,49 @@ scheduling system
55

66
### 1.Glossary
77

8-
**DAG:** The full name is Directed Acyclic Graph, referred to as DAG. Task tasks in the workflow are assembled in the
9-
form of a directed acyclic graph, and topological traversal is performed from nodes with zero degrees of entry until
10-
there are no subsequent nodes. Examples are as follows:
8+
**DAG:** The full name is Directed Acyclic Graph, referred to as DAG. Tasks in the workflow are assembled in the
9+
form of a directed acyclic graph, and topological traversal is performed from nodes with zero in-degree until
10+
there are no subsequent nodes. An example is shown below:
1111

1212
![about-glossary](../../../img/new_ui/dev/about/glossary.png)
1313

14-
**Process definition**: Visualization formed by dragging task nodes and establishing task node associations**DAG**
14+
**Process definition**: Visualization of a **DAG** formed by dragging task nodes and establishing associations between them.
1515

16-
**Process instance**: The process instance is the instantiation of the process definition, which can be generated by
17-
manual start or scheduled scheduling. Each time the process definition runs, a process instance is generated
16+
**Process instance**: A process instance is the instantiation of a process definition, which can be generated by
17+
manual start or scheduled triggering. Each time a process definition runs, a process instance is generated
1818

19-
**Task instance**: The task instance is the instantiation of the task node in the process definition, which identifies
20-
the specific task
19+
**Task instance**: An instantiation of a task node within a process definition, representing a specific execution
20+
of that task.
2121

2222
**Task type**: Currently supports SHELL, SQL, SUB_WORKFLOW, PROCEDURE, MR, SPARK, PYTHON, DEPENDENT (
23-
depends), and plans to support dynamic plug-in expansion, note: **SUB_WORKFLOW** need relation with another workflow definition which also a separate process
24-
definition that can be started and executed separately
23+
depends), and plans to support dynamic plug-in expansion, note: **SUB_WORKFLOW** need be associated with another
24+
workflow definition which is a separate process definition that can be started and executed separately
2525

26-
**Scheduling method**: The system supports scheduled scheduling and manual scheduling based on cron expressions. Command
27-
type support: start workflow, start execution from current node, resume fault-tolerant workflow, resume pause process,
28-
start execution from failed node, complement, timing, rerun, pause, stop, resume waiting thread. Among them **Resume
29-
fault-tolerant workflow** and **Resume waiting thread** The two command types are used by the internal control of
30-
scheduling, and cannot be called from the outside
26+
**Scheduling method**: The system supports scheduled triggering (based on cron expressions) and manual triggering.
27+
Command types support: start workflow, start execution from current node, resume fault-tolerant workflow,
28+
resume pause process, start execution from failed node, complement, timing, rerun, pause, stop, resume waiting thread.
29+
Among them the command types **Resume fault-tolerant workflow** and **Resume waiting thread** are used by the internal
30+
scheduling control and cannot be invoked externally.
3131

32-
**Scheduled**: System adopts **quartz** distributed scheduler, and supports the visual generation of cron expressions
32+
**Scheduled**: The system adopts **quartz** distributed scheduler, and supports visual generation of cron expressions
3333

34-
**Rely**: The system not only supports **DAG** simple dependencies between the predecessor and successor nodes, but also
35-
provides **task dependent** nodes, supporting **between processes**
34+
**Dependencies**: The system not only supports simple **DAG** dependencies between predecessor and successor nodes, but also
35+
provides **task dependent** nodes, supporting dependencies **between processes**
3636

37-
**Priority**: Support the priority of process instances and task instances, if the priority of process instances and
38-
task instances is not set, the default is first-in-first-out
37+
**Priority**: Supports priority settings for both process instances and task instances. If no priority is specified,
38+
the system defaults to a first-in, first-out (FIFO) execution order.
3939

4040
**Email alert**: Support **SQL task** Query result email sending, process instance running result email alert and fault
4141
tolerance alert notification
4242

43-
**Failure strategy**: For tasks running in parallel, if a task fails, two failure strategy processing methods are
44-
provided. **Continue** refers to regardless of the status of the task running in parallel until the end of the process
45-
failure. **End** means that once a failed task is found, Kill will also run the parallel task at the same time, and the
46-
process fails and ends
43+
**Failure strategy**: For workflows with parallel task execution, the system provides two failure handling strategies.
44+
**Continue** If a task fails, the system continues executing other parallel tasks to completion, regardless of the failure.
45+
The overall process is marked as failed only after all parallel tasks have finished running.
46+
**End** means that upon a task failure, the system immediately marks the process as failed and terminates any currently
47+
running parallel tasks
4748

48-
**Complement**: Supplement historical data,supports **interval parallel** and **serial** two complement methods, and two types of date selection which include **date range** and **date enumeration**.
49+
**Complement**: backfilling historical data,supports **interval parallel** and **serial** two complement modes,
50+
and two date selection methods including **date range** and **date enumeration**.
4951

5052
### 2.Module introduction
5153

@@ -71,5 +73,5 @@ process fails and ends
7173
### Sum up
7274

7375
From the perspective of scheduling, this article preliminarily introduces the architecture principles and implementation
74-
ideas of the big data distributed workflow scheduling system-DolphinScheduler. To be continued
76+
ideas of the big data distributed workflow scheduling systemDolphinScheduler. To be continued
7577

0 commit comments

Comments
 (0)