Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[bug](schema change)fix schema change cause load failed due to err -215 #23836

Open
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

DarvenDuan
Copy link
Contributor

@DarvenDuan DarvenDuan commented Sep 4, 2023

Proposed changes

If Doris schema change job and load job execute in parallel, load job may be failed after schema change job finished.

Schema change job will generate a new shadow index for loading new data stream and convert history data. After schema change job finished, FE will delete the origin index and its' tablet in fe's meta, and then sends drop tablet task to BE to drop origin tablet meta and data in BE. But if a load job has not finished, which is loading data to both origin tablet and new tablet, it will fail due to OLAP_ERR_TABLE_NOT_FOUND.

This PR implements that Doris will not delete tablets of origin index immediately when schema change job is finished, but set the tablets' state to DECOMMISSION, Doris will delete those tablets later after all transactions on those tablets are finished.

Further comments

If this is a relatively large or complex change, kick off the discussion at dev@doris.apache.org by explaining why you chose the solution you did and what alternatives you considered, etc...

@DarvenDuan DarvenDuan force-pushed the fix_schema_change_bug branch 5 times, most recently from d0b2d08 to 6eb39ab Compare September 4, 2023 09:42
@hello-stephen
Copy link
Contributor

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 46.95 seconds
stream load tsv: 530 seconds loaded 74807831229 Bytes, about 134 MB/s
stream load json: 20 seconds loaded 2358488459 Bytes, about 112 MB/s
stream load orc: 65 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 30 seconds loaded 861443392 Bytes, about 27 MB/s
insert into select: 28.9 seconds inserted 10000000 Rows, about 346K ops/s
storage size: 17161983646 Bytes

@hello-stephen
Copy link
Contributor

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 50.03 seconds
stream load tsv: 536 seconds loaded 74807831229 Bytes, about 133 MB/s
stream load json: 20 seconds loaded 2358488459 Bytes, about 112 MB/s
stream load orc: 64 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 31 seconds loaded 861443392 Bytes, about 26 MB/s
insert into select: 29.4 seconds inserted 10000000 Rows, about 340K ops/s
storage size: 17162240138 Bytes

@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 49.72 seconds
stream load tsv: 535 seconds loaded 74807831229 Bytes, about 133 MB/s
stream load json: 21 seconds loaded 2358488459 Bytes, about 107 MB/s
stream load orc: 64 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 31 seconds loaded 861443392 Bytes, about 26 MB/s
insert into select: 29.7 seconds inserted 10000000 Rows, about 336K ops/s
storage size: 17161982869 Bytes

@DarvenDuan
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 45.69 seconds
stream load tsv: 560 seconds loaded 74807831229 Bytes, about 127 MB/s
stream load json: 20 seconds loaded 2358488459 Bytes, about 112 MB/s
stream load orc: 64 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 33 seconds loaded 861443392 Bytes, about 24 MB/s
insert into select: 28.8 seconds inserted 10000000 Rows, about 347K ops/s
storage size: 17162002702 Bytes

@DarvenDuan DarvenDuan force-pushed the fix_schema_change_bug branch from 7a45778 to 871005b Compare October 7, 2023 06:50
@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 45.26 seconds
stream load tsv: 561 seconds loaded 74807831229 Bytes, about 127 MB/s
stream load json: 21 seconds loaded 2358488459 Bytes, about 107 MB/s
stream load orc: 64 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 33 seconds loaded 861443392 Bytes, about 24 MB/s
insert into select: 28.9 seconds inserted 10000000 Rows, about 346K ops/s
storage size: 17162644398 Bytes

@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 47.35 seconds
stream load tsv: 555 seconds loaded 74807831229 Bytes, about 128 MB/s
stream load json: 20 seconds loaded 2358488459 Bytes, about 112 MB/s
stream load orc: 65 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 32 seconds loaded 861443392 Bytes, about 25 MB/s
insert into select: 28.9 seconds inserted 10000000 Rows, about 346K ops/s
storage size: 17162015386 Bytes

@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 46.23 seconds
stream load tsv: 553 seconds loaded 74807831229 Bytes, about 129 MB/s
stream load json: 20 seconds loaded 2358488459 Bytes, about 112 MB/s
stream load orc: 65 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 32 seconds loaded 861443392 Bytes, about 25 MB/s
insert into select: 29.1 seconds inserted 10000000 Rows, about 343K ops/s
storage size: 17161917001 Bytes

@DarvenDuan DarvenDuan force-pushed the fix_schema_change_bug branch from aab7203 to 273e1fe Compare October 24, 2023 12:30
@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 46.22 seconds
stream load tsv: 553 seconds loaded 74807831229 Bytes, about 129 MB/s
stream load json: 20 seconds loaded 2358488459 Bytes, about 112 MB/s
stream load orc: 65 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 32 seconds loaded 861443392 Bytes, about 25 MB/s
insert into select: 29.1 seconds inserted 10000000 Rows, about 343K ops/s
storage size: 17162123071 Bytes

@DarvenDuan DarvenDuan force-pushed the fix_schema_change_bug branch 2 times, most recently from 2643cba to 8ebafe7 Compare October 25, 2023 02:57
@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 46.15 seconds
stream load tsv: 551 seconds loaded 74807831229 Bytes, about 129 MB/s
stream load json: 20 seconds loaded 2358488459 Bytes, about 112 MB/s
stream load orc: 64 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 32 seconds loaded 861443392 Bytes, about 25 MB/s
insert into select: 28.8 seconds inserted 10000000 Rows, about 347K ops/s
storage size: 17162195397 Bytes

@DarvenDuan DarvenDuan force-pushed the fix_schema_change_bug branch 7 times, most recently from e9f7f1f to 46e06fe Compare October 25, 2023 12:59
@DarvenDuan
Copy link
Contributor Author

Maybe we should take cloud schema change and cloud rollup into account in current master branch.

I will try to implement this for cloud model in the future.

@DarvenDuan
Copy link
Contributor Author

run p0

@TangSiyang2001
Copy link
Collaborator

run buildall

Comment on lines 648 to 653
try {
this.deleteTabletWatermarkTxnId =
Env.getCurrentGlobalTransactionMgr().getNextTransactionId();
} catch (UserException e) {
LOG.warn("get next transaction id failed");
}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This assign method could be better put into AlterJob.java, just add an unified method like:

protected void assignDeleteTabletWatermarkTxnId() {
    try {
            this.deleteTabletWatermarkTxnId =
                    Env.getCurrentGlobalTransactionMgr().getNextTransactionId();
        } catch (UserException e) {
            LOG.warn("get next transaction id failed");
        }
}

and we cloud just call assignDeleteTabletWatermarkTxnId() here and also in SchemaChangeJobV2, even in CloudSchemaChangeJob and CloudRollupJob in the future.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

@doris-robot
Copy link

TPC-H: Total hot run time: 40319 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 9c3860305621eedc80f82a622189a67e549bf8c2, data reload: false

------ Round 1 ----------------------------------
q1	17637	4323	4241	4241
q2	2022	183	176	176
q3	11880	1062	1065	1062
q4	10640	811	796	796
q5	7755	2778	2776	2776
q6	223	137	137	137
q7	966	593	601	593
q8	9551	2026	2074	2026
q9	10295	6621	6596	6596
q10	7014	2212	2234	2212
q11	459	239	242	239
q12	393	222	217	217
q13	19646	3023	2990	2990
q14	286	251	235	235
q15	538	500	503	500
q16	509	390	382	382
q17	971	675	764	675
q18	8030	7522	7410	7410
q19	7368	1103	1023	1023
q20	721	358	341	341
q21	5534	4725	4629	4629
q22	1171	1063	1066	1063
Total cold run time: 123609 ms
Total hot run time: 40319 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5336	4517	4492	4492
q2	384	267	279	267
q3	3178	2814	2783	2783
q4	1988	1789	1796	1789
q5	5623	5752	5640	5640
q6	228	140	138	138
q7	2186	1788	1758	1758
q8	3260	3383	3450	3383
q9	8824	8677	8656	8656
q10	3487	3293	3255	3255
q11	586	494	507	494
q12	824	638	612	612
q13	17205	3221	3153	3153
q14	306	294	291	291
q15	541	500	502	500
q16	497	426	422	422
q17	1822	1537	1505	1505
q18	7982	8160	7827	7827
q19	1729	1634	1498	1498
q20	2178	1920	1894	1894
q21	5331	5241	5315	5241
q22	1185	1078	1026	1026
Total cold run time: 74680 ms
Total hot run time: 56624 ms

@TangSiyang2001
Copy link
Collaborator

TangSiyang2001 commented Aug 13, 2024

It seems load job come after the water mark but before tablet decomission will still run into this problem. However, this PR may prevernt -215 in most cases. I have no better ideas currently @DarvenDuan @dataroaring .

@doris-robot
Copy link

TPC-DS: Total hot run time: 189677 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 9c3860305621eedc80f82a622189a67e549bf8c2, data reload: false

query1	1253	877	858	858
query2	6293	1840	1848	1840
query3	10610	3812	3758	3758
query4	58353	23920	23313	23313
query5	6084	504	518	504
query6	509	202	151	151
query7	6498	296	291	291
query8	300	200	198	198
query9	9143	2429	2419	2419
query10	485	275	266	266
query11	18757	14960	15049	14960
query12	161	104	104	104
query13	1585	397	378	378
query14	11862	7868	6914	6914
query15	237	178	181	178
query16	7288	548	479	479
query17	1143	591	586	586
query18	1981	311	306	306
query19	270	161	154	154
query20	120	110	112	110
query21	216	112	105	105
query22	4572	4423	4173	4173
query23	34100	33551	33206	33206
query24	5581	2888	2906	2888
query25	536	411	412	411
query26	704	159	160	159
query27	1804	281	279	279
query28	3880	2053	2046	2046
query29	668	428	427	427
query30	242	150	148	148
query31	954	729	729	729
query32	89	55	58	55
query33	476	298	285	285
query34	867	472	475	472
query35	804	727	729	727
query36	1031	920	932	920
query37	135	81	88	81
query38	3979	3937	3844	3844
query39	1444	1401	1402	1401
query40	208	127	120	120
query41	49	43	43	43
query42	119	104	104	104
query43	503	464	450	450
query44	1100	746	742	742
query45	194	168	166	166
query46	1094	729	721	721
query47	1828	1776	1778	1776
query48	371	303	299	299
query49	776	428	439	428
query50	818	416	414	414
query51	6854	6612	6669	6612
query52	98	104	92	92
query53	269	191	186	186
query54	590	463	450	450
query55	77	74	80	74
query56	284	358	246	246
query57	1148	1055	1074	1055
query58	230	248	226	226
query59	2978	2604	2706	2604
query60	281	279	255	255
query61	100	96	115	96
query62	732	649	644	644
query63	220	185	183	183
query64	3300	1714	1709	1709
query65	3182	3129	3184	3129
query66	719	343	351	343
query67	15400	14896	14910	14896
query68	6369	556	563	556
query69	639	374	301	301
query70	1214	1089	1099	1089
query71	484	299	299	299
query72	7357	2269	2025	2025
query73	795	327	336	327
query74	9077	8799	8849	8799
query75	4479	2719	2654	2654
query76	3458	987	959	959
query77	761	313	301	301
query78	9745	8967	9156	8967
query79	9004	550	548	548
query80	1212	509	514	509
query81	557	227	224	224
query82	1020	139	137	137
query83	403	149	151	149
query84	276	73	77	73
query85	1247	274	275	274
query86	362	285	306	285
query87	4372	4159	4147	4147
query88	4908	2424	2407	2407
query89	559	294	282	282
query90	1966	199	196	196
query91	121	98	98	98
query92	64	51	51	51
query93	6620	548	541	541
query94	691	268	298	268
query95	351	271	263	263
query96	626	277	275	275
query97	3198	3011	3045	3011
query98	225	207	201	201
query99	1712	1282	1275	1275
Total cold run time: 332408 ms
Total hot run time: 189677 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 30.79 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 9c3860305621eedc80f82a622189a67e549bf8c2, data reload: false

query1	0.04	0.04	0.04
query2	0.07	0.04	0.04
query3	0.22	0.05	0.05
query4	1.68	0.07	0.08
query5	0.48	0.47	0.48
query6	1.13	0.73	0.72
query7	0.02	0.01	0.01
query8	0.05	0.04	0.04
query9	0.54	0.48	0.48
query10	0.54	0.55	0.54
query11	0.15	0.12	0.12
query12	0.15	0.12	0.12
query13	0.60	0.60	0.59
query14	0.77	0.79	0.77
query15	0.87	0.81	0.80
query16	0.37	0.36	0.37
query17	0.95	1.03	1.04
query18	0.23	0.21	0.20
query19	1.86	1.73	1.73
query20	0.01	0.00	0.00
query21	15.40	0.74	0.66
query22	3.94	7.02	2.05
query23	18.27	1.43	1.27
query24	2.16	0.22	0.21
query25	0.15	0.08	0.06
query26	0.30	0.21	0.21
query27	0.46	0.22	0.22
query28	13.27	1.02	1.01
query29	12.61	3.35	3.29
query30	0.23	0.05	0.05
query31	2.89	0.39	0.38
query32	3.26	0.49	0.48
query33	2.96	2.96	2.95
query34	17.11	4.35	4.39
query35	4.40	4.41	4.43
query36	0.66	0.46	0.47
query37	0.18	0.16	0.15
query38	0.15	0.15	0.14
query39	0.05	0.04	0.03
query40	0.15	0.12	0.11
query41	0.09	0.04	0.05
query42	0.06	0.05	0.04
query43	0.05	0.04	0.04
Total cold run time: 109.53 s
Total hot run time: 30.79 s

@DarvenDuan
Copy link
Contributor Author

DarvenDuan commented Aug 13, 2024

It seems load job come after the water mark but before tablet decomission will still run into this problem. However, this PR may prevernt -215 in most cases. I have no better ideas currently @DarvenDuan @dataroaring .

Doris holds the table's writeLock,then sets tablets to decommission and deletes index infos(olapTable.deleteIndexInfo(rollupIndexName)),load jobs will wait table's writeLock and get table's index infos for loading,so this should not happen In theory.

@TangSiyang2001
Copy link
Collaborator

It seems load job come after the water mark but before tablet decomission will still run into this problem. However, this PR may prevernt -215 in most cases. I have no better ideas currently @DarvenDuan @dataroaring .

Doris holds the table's writeLock,then sets tablets to decommission and deletes index infos(olapTable.deleteIndexInfo(rollupIndexName)),load jobs will wait table's writeLock and get table's index infos for loading,so this should not happen In theory.

Get it, not problems for me.

Copy link
Collaborator

@TangSiyang2001 TangSiyang2001 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Contributor

PR approved by anyone and no changes requested.

@doris-robot
Copy link

TPC-H: Total hot run time: 40063 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 4efbbf466def997cef1a12e655d959afd0049f3f, data reload: false

------ Round 1 ----------------------------------
q1	18524	8855	4391	4391
q2	2023	178	189	178
q3	11665	977	1115	977
q4	10464	749	726	726
q5	7744	2813	2802	2802
q6	224	137	140	137
q7	952	591	594	591
q8	9303	2079	2055	2055
q9	8537	6525	6529	6525
q10	7037	2224	2123	2123
q11	459	244	252	244
q12	395	225	223	223
q13	17768	2977	2981	2977
q14	282	231	224	224
q15	535	479	470	470
q16	492	395	380	380
q17	967	699	722	699
q18	8148	7478	7448	7448
q19	4721	1022	994	994
q20	656	331	331	331
q21	5429	4572	4604	4572
q22	1088	1008	996	996
Total cold run time: 117413 ms
Total hot run time: 40063 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4509	4314	4310	4310
q2	377	275	285	275
q3	3011	2791	2673	2673
q4	2063	1757	1747	1747
q5	5639	5568	5640	5568
q6	232	142	140	140
q7	2164	1779	1762	1762
q8	3285	3437	3412	3412
q9	8845	8911	8764	8764
q10	3416	3281	3290	3281
q11	615	525	510	510
q12	829	609	673	609
q13	16919	3124	3179	3124
q14	326	291	301	291
q15	535	477	503	477
q16	519	454	444	444
q17	1817	1565	1508	1508
q18	8227	8032	8015	8015
q19	1826	1686	1558	1558
q20	2975	1864	1894	1864
q21	5520	5146	5382	5146
q22	1273	1045	1007	1007
Total cold run time: 74922 ms
Total hot run time: 56485 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 189973 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 4efbbf466def997cef1a12e655d959afd0049f3f, data reload: false

query1	1257	885	872	872
query2	6337	1901	1886	1886
query3	10608	3740	3730	3730
query4	59128	27091	23162	23162
query5	6061	509	501	501
query6	520	166	166	166
query7	6519	293	287	287
query8	306	207	201	201
query9	9017	2419	2380	2380
query10	512	284	245	245
query11	18599	14964	15340	14964
query12	146	101	101	101
query13	1616	369	382	369
query14	12374	7235	7849	7235
query15	252	165	166	165
query16	7135	516	492	492
query17	1141	579	533	533
query18	1998	288	294	288
query19	289	172	165	165
query20	124	113	108	108
query21	213	109	101	101
query22	4547	4349	4333	4333
query23	34331	33414	33223	33223
query24	5579	2912	2918	2912
query25	515	381	368	368
query26	705	157	156	156
query27	1788	265	268	265
query28	3848	2008	2001	2001
query29	650	402	401	401
query30	235	148	144	144
query31	947	787	721	721
query32	98	53	59	53
query33	498	280	280	280
query34	862	495	478	478
query35	806	726	701	701
query36	1085	905	949	905
query37	130	78	80	78
query38	3927	3880	3767	3767
query39	1455	1372	1390	1372
query40	199	124	111	111
query41	44	43	44	43
query42	108	92	93	92
query43	495	481	460	460
query44	1066	728	722	722
query45	196	162	164	162
query46	1080	726	720	720
query47	1871	1759	1778	1759
query48	372	293	291	291
query49	751	406	421	406
query50	799	395	397	395
query51	6693	6673	6662	6662
query52	95	92	88	88
query53	253	182	181	181
query54	560	434	457	434
query55	74	71	75	71
query56	257	240	237	237
query57	1126	1074	1019	1019
query58	244	223	222	222
query59	3002	2818	2969	2818
query60	297	256	257	256
query61	107	93	94	93
query62	747	626	641	626
query63	210	180	179	179
query64	9140	2278	7529	2278
query65	3238	3210	3144	3144
query66	676	356	332	332
query67	15171	14740	14785	14740
query68	5756	539	537	537
query69	415	272	275	272
query70	1214	1052	1109	1052
query71	442	275	277	275
query72	6939	2306	2003	2003
query73	744	323	318	318
query74	9305	8701	8738	8701
query75	3371	2712	2677	2677
query76	2859	967	998	967
query77	558	309	299	299
query78	9773	9121	9238	9121
query79	2146	523	530	523
query80	1299	483	479	479
query81	553	220	231	220
query82	685	133	132	132
query83	244	144	144	144
query84	265	79	75	75
query85	1268	279	270	270
query86	488	300	292	292
query87	4446	4190	4174	4174
query88	4240	2367	2384	2367
query89	398	288	278	278
query90	1834	189	189	189
query91	121	96	93	93
query92	65	50	50	50
query93	2659	526	532	526
query94	850	296	277	277
query95	360	260	257	257
query96	608	281	274	274
query97	3214	3047	3029	3029
query98	233	205	194	194
query99	1529	1262	1258	1258
Total cold run time: 323910 ms
Total hot run time: 189973 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 30.01 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 4efbbf466def997cef1a12e655d959afd0049f3f, data reload: false

query1	0.05	0.04	0.04
query2	0.07	0.04	0.04
query3	0.23	0.06	0.06
query4	1.66	0.09	0.10
query5	0.48	0.50	0.50
query6	1.13	0.73	0.72
query7	0.02	0.02	0.01
query8	0.05	0.05	0.05
query9	0.54	0.48	0.49
query10	0.54	0.55	0.53
query11	0.15	0.12	0.12
query12	0.16	0.12	0.12
query13	0.63	0.61	0.59
query14	0.76	0.77	0.80
query15	0.84	0.82	0.82
query16	0.36	0.37	0.37
query17	1.05	0.99	1.02
query18	0.23	0.22	0.22
query19	1.89	1.69	1.72
query20	0.01	0.01	0.01
query21	15.40	0.76	0.66
query22	5.20	7.06	1.19
query23	18.30	1.41	1.29
query24	2.09	0.22	0.22
query25	0.15	0.09	0.09
query26	0.29	0.21	0.22
query27	0.46	0.23	0.22
query28	13.36	1.03	1.00
query29	12.66	3.38	3.32
query30	0.24	0.05	0.04
query31	2.89	0.40	0.40
query32	3.26	0.50	0.47
query33	2.94	2.97	2.96
query34	17.15	4.32	4.32
query35	4.46	4.40	4.37
query36	0.65	0.47	0.49
query37	0.19	0.15	0.16
query38	0.16	0.14	0.14
query39	0.04	0.03	0.04
query40	0.15	0.13	0.12
query41	0.10	0.04	0.05
query42	0.06	0.05	0.04
query43	0.04	0.04	0.04
Total cold run time: 111.09 s
Total hot run time: 30.01 s

@doris-robot
Copy link

TPC-H: Total hot run time: 37425 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 1c30704f25542a0420d1fd831164d9fedf0efb7e, data reload: false

------ Round 1 ----------------------------------
q1	18310	4354	4263	4263
q2	2039	190	169	169
q3	10513	1128	1148	1128
q4	10160	725	726	725
q5	7729	2811	2710	2710
q6	223	136	135	135
q7	956	594	597	594
q8	9330	2033	2052	2033
q9	7122	6535	6520	6520
q10	7060	2208	2249	2208
q11	448	241	232	232
q12	392	218	227	218
q13	18642	2980	2953	2953
q14	270	235	233	233
q15	531	472	477	472
q16	517	386	382	382
q17	975	728	725	725
q18	7708	6904	6632	6632
q19	5606	971	995	971
q20	633	321	339	321
q21	3802	2949	2784	2784
q22	1094	1053	1017	1017
Total cold run time: 114060 ms
Total hot run time: 37425 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4470	4219	4264	4219
q2	387	275	278	275
q3	2860	2584	2634	2584
q4	1855	1577	1593	1577
q5	5400	5608	5594	5594
q6	223	129	131	129
q7	2178	1777	1702	1702
q8	3249	3409	3446	3409
q9	8745	8784	8890	8784
q10	3531	3315	3224	3224
q11	610	500	543	500
q12	814	623	599	599
q13	16861	2984	3170	2984
q14	329	283	300	283
q15	523	479	498	479
q16	493	430	444	430
q17	1826	1529	1500	1500
q18	8094	7999	7292	7292
q19	1680	1442	1539	1442
q20	2044	1762	1788	1762
q21	5228	5144	5094	5094
q22	1122	999	977	977
Total cold run time: 72522 ms
Total hot run time: 54839 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 183972 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 1c30704f25542a0420d1fd831164d9fedf0efb7e, data reload: false

query1	890	381	366	366
query2	6464	1921	1802	1802
query3	6631	220	215	215
query4	30572	23102	23001	23001
query5	4184	490	501	490
query6	274	172	173	172
query7	4630	287	295	287
query8	258	195	208	195
query9	8482	2434	2393	2393
query10	410	250	250	250
query11	17775	14910	14956	14910
query12	146	96	96	96
query13	1599	352	354	352
query14	9065	6847	6764	6764
query15	213	167	165	165
query16	7654	480	480	480
query17	1340	548	530	530
query18	1859	280	272	272
query19	190	141	137	137
query20	114	105	104	104
query21	201	101	94	94
query22	4174	4058	4023	4023
query23	33832	33269	33068	33068
query24	11848	2818	2759	2759
query25	667	369	360	360
query26	1723	149	149	149
query27	2854	263	264	263
query28	7356	2042	2029	2029
query29	1036	402	411	402
query30	306	158	152	152
query31	964	749	747	747
query32	128	52	53	52
query33	737	283	275	275
query34	948	446	457	446
query35	834	728	728	728
query36	1047	902	923	902
query37	159	79	79	79
query38	3927	3801	3799	3799
query39	1441	1384	1395	1384
query40	282	115	109	109
query41	47	44	45	44
query42	118	93	97	93
query43	505	468	463	463
query44	1176	721	721	721
query45	192	166	164	164
query46	1107	770	714	714
query47	1843	1797	1764	1764
query48	351	290	277	277
query49	1086	405	401	401
query50	789	397	409	397
query51	6798	6762	6670	6670
query52	95	88	87	87
query53	276	185	186	185
query54	877	445	440	440
query55	75	73	75	73
query56	257	251	262	251
query57	1122	1067	1043	1043
query58	225	225	218	218
query59	2864	2732	2811	2732
query60	302	266	256	256
query61	94	94	95	94
query62	833	636	643	636
query63	217	178	189	178
query64	6340	2248	1734	1734
query65	3185	3178	3158	3158
query66	1333	337	327	327
query67	15547	14920	14989	14920
query68	4510	525	536	525
query69	391	264	292	264
query70	1127	1139	1124	1124
query71	397	272	270	270
query72	6471	2233	1994	1994
query73	744	312	360	312
query74	9073	8884	8778	8778
query75	3433	2619	2676	2619
query76	2587	1015	1012	1012
query77	462	309	296	296
query78	9535	9056	8971	8971
query79	2479	523	519	519
query80	2035	475	470	470
query81	587	230	219	219
query82	795	138	129	129
query83	278	149	143	143
query84	271	82	79	79
query85	2049	270	259	259
query86	526	294	313	294
query87	4335	4289	4174	4174
query88	4374	2270	2263	2263
query89	388	290	289	289
query90	1903	193	196	193
query91	119	95	95	95
query92	65	47	48	47
query93	1788	531	531	531
query94	959	288	280	280
query95	348	250	250	250
query96	596	262	267	262
query97	3208	3067	3023	3023
query98	217	205	206	205
query99	1631	1242	1244	1242
Total cold run time: 292549 ms
Total hot run time: 183972 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 30.99 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 1c30704f25542a0420d1fd831164d9fedf0efb7e, data reload: false

query1	0.05	0.04	0.04
query2	0.08	0.04	0.04
query3	0.23	0.05	0.05
query4	1.67	0.07	0.07
query5	0.49	0.50	0.52
query6	1.15	0.73	0.72
query7	0.01	0.02	0.01
query8	0.05	0.04	0.05
query9	0.54	0.50	0.49
query10	0.56	0.54	0.55
query11	0.15	0.12	0.12
query12	0.14	0.12	0.12
query13	0.60	0.61	0.59
query14	0.76	0.78	0.78
query15	0.83	0.82	0.80
query16	0.35	0.38	0.37
query17	1.05	1.02	1.01
query18	0.22	0.22	0.21
query19	1.84	1.73	1.72
query20	0.01	0.01	0.00
query21	15.41	0.73	0.65
query22	4.21	6.68	2.21
query23	18.29	1.39	1.26
query24	2.06	0.23	0.22
query25	0.16	0.08	0.09
query26	0.30	0.21	0.20
query27	0.45	0.24	0.22
query28	13.29	1.01	1.01
query29	12.62	3.36	3.32
query30	0.24	0.05	0.05
query31	2.89	0.39	0.38
query32	3.27	0.50	0.48
query33	2.91	2.98	2.93
query34	17.12	4.32	4.37
query35	4.36	4.45	4.40
query36	0.66	0.48	0.47
query37	0.19	0.16	0.16
query38	0.16	0.15	0.15
query39	0.05	0.03	0.04
query40	0.15	0.12	0.12
query41	0.08	0.04	0.05
query42	0.06	0.04	0.04
query43	0.05	0.03	0.04
Total cold run time: 109.76 s
Total hot run time: 30.99 s

@DarvenDuan DarvenDuan force-pushed the fix_schema_change_bug branch 2 times, most recently from 816efb6 to 7c7bda4 Compare August 16, 2024 05:31
@DarvenDuan DarvenDuan force-pushed the fix_schema_change_bug branch from 7c7bda4 to 398082d Compare October 11, 2024 02:31
@DarvenDuan
Copy link
Contributor Author

run buildall

Copy link
Collaborator

@TangSiyang2001 TangSiyang2001 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants