基于依存关系图注意力网络的SQL生成方法

doi:10.3785/j.issn.1008-973X.2024.05.004

浙江大学学报(工学版)

2024, Vol. 58

Issue (5): 908-917 DOI: 10.3785/j.issn.1008-973X.2024.05.004

计算机技术、通信技术

基于依存关系图注意力网络的SQL生成方法

舒晴1,2(

),刘喜平1,*(

),谭钊1,李希1,万常选1,刘德喜1,廖国琼1

1. 江西财经大学信息管理学院，江西南昌 330013
2. 江西农业大学软件学院，江西南昌 330013

SQL generation method based on dependency relational graphattention network

Qing SHU1,2(

),Xiping LIU1,*(

),Zhao TAN1,Xi LI1,Changxuan WAN1,Dexi LIU1,Guoqiong LIAO1

1. School of Information Technology, Jiangxi University of Finance and Economics, Nanchang 330013, China
2. School of Software, Jiangxi Agricultural University, Nanchang 330013, China

全文: PDF(871 KB) HTML

摘要：

研究基于自然语言问题的结构化查询语言(SQL)生成问题(Text-to-SQL). 提出两阶段框架，旨在解耦模式链接和SQL生成过程，降低SQL生成的难度. 第1阶段通过基于关系图注意力网络的模式链接器识别问题中提及的数据库表、列和值，利用问题的语法结构和数据库模式项之间的内部关系，指导模型学习问题与数据库的对齐关系. 构建问题图时，针对Text-to-SQL任务的特点，在原始句法依存树的基础上，合并与模式链接无关的关系，添加并列结构中的从属词与句中其他成分间的依存关系，帮助模型捕获长距离依赖关系. 第2阶段进行SQL生成，将对齐信息注入T5的编码器，对T5进行微调. 在Spider、Spider-DK和Spider-Syn数据集上进行实验，实验结果显示，该方法具有良好的性能，尤其是对中等难度以上的Text-to-SQL问题具有良好的表现.

关键词： Text-to-SQL; 自然语言查询; 依存句法分析; 关系图注意力网络

Abstract:

The problem of generating structured query language (SQL) from natural language questions (Text-to-SQL) was analyzed. A two-stage framework was proposed to decouple the processes of schema linking and SQL generation in order to reduce the complexity of SQL generation. Database tables, columns, and values mentioned in the question were identified by a schema linker based on relational graph attention network in the first stage. The syntactic structure of the question and the internal relationships between database schema items were used to guide the model in learning the alignment between the question and the database. The original syntactic dependency tree was modified by merging relationships irrelevant to schema linking and adding dependencies between subordinating conjunctions in parallel structures and other elements in the sentence in view of the characteristics of Text-to-SQL task when constructing the question graph, which helps the model capture long-distance dependencies. SQL generation was performed by injecting the alignment information into the T5 encoder and fine-tuning it in the second stage. Experiments were conducted on the Spider, Spider-DK and Spider-Syn datasets. Results showed that the method performed well, especially for Text-to-SQL problems of medium difficulty and above.

Key words: Text-to-SQL natural language query dependency parsing relational graph attention network

收稿日期: 2023-07-03 出版日期: 2024-04-26

CLC:

TP 393

基金资助: 国家自然科学基金资助项目(62076112, 62272205, 62272206, 62272207)；江西省自然科学基金资助项目（20232ACB202008）；江西省教育厅科学技术研究项目（GJJ190255）；江西省研究生创新专项资助项目（YC2023-B185）.

通讯作者: 刘喜平 E-mail: shu_0124@163.com;lewislxp@gmail.com

作者简介: 舒晴（1990—），女，博士生，从事自然语言处理的研究. orcid.org/0009-0007-0365-7621. E-mail：shu_0124@163.com

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	作者相关文章
	舒晴
	刘喜平
	谭钊
	李希
	万常选
	刘德喜
	廖国琼

引用本文:

舒晴,刘喜平,谭钊,李希,万常选,刘德喜,廖国琼. 基于依存关系图注意力网络的SQL生成方法[J]. 浙江大学学报(工学版), 2024, 58(5): 908-917.

Qing SHU,Xiping LIU,Zhao TAN,Xi LI,Changxuan WAN,Dexi LIU,Guoqiong LIAO. SQL generation method based on dependency relational graphattention network. Journal of ZheJiang University (Engineering Science), 2024, 58(5): 908-917.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2024.05.004 或 https://www.zjujournals.com/eng/CN/Y2024/V58/I5/908

图 1 基于依存关系图注意力网络的两阶段SQL生成框架

图 2 模式链接器的结构

表 1 保留的依存关系集合

图 3 依存关系修改示例

表 2 模式项之间的关系

表 3 模式链接的结果

表 4 部分样本模型预测结果的对比

表 5 模式链接的消融实验结果

表 6 列预测结果的对比

表 7 输入序列的对比

表 8 SQL生成实验结果的对比

表 9 不同难度上的EX结果对比

表 10 Spider-DK和Spider-Syn上的SQL生成结果

1	刘喜平, 舒晴, 何佳壕, 等基于自然语言的数据库查询生成研究综述[J]. 软件学报, 2022, 33 (11): 4107- 4136 LIU Xiping, SHU Qing, HE Jiahao, et al Survey on generating database queries based on natural language[J]. Journal of Software, 2022, 33 (11): 4107- 4136
2	GUO J, ZHAN Z, GAO Y, et al. Towards complex text-to-SQL in cross-domain database with intermediate representation [C]// Proceedings of ACL . Florence: ACL, 2019: 4524-4535.
3	WANG K, SHEN W, YANG Y, et al. Relational graph attention network for aspect-based sentiment analysis [C]// Proceedings of ACL . [S. l. ]: ACL, 2020: 3229-3238.
4	XU X, LIU C, SONG D X. SQLNet: generating structured queries from natural language without reinforcement learning [EB/OL]. (2017-11-13) [2023-10-27]. https://arxiv.org/abs/1711.04436.
5	YU T, LI Z, ZHANG Z, et al. TypeSQL: knowledge-based type-aware neural text-to-SQL generation [C]// Proceedings of NAACL-HLT . New Orleans: ACL, 2018: 588-594.
6	LEI W, WANG W, MA Z, et al. Re-examining the role of schema linking in text-to-SQL [C]// Proceedings of EMNLP . [S. l.]: ACL, 2020: 6943-6954.
7	CHOI D, SHIN M C, KIM E, et al RYANSQL: recursively applying sketch-based slot fillings for complex text-to-SQL in cross-domain databases[J]. Computational Linguistics, 2021, 47 (2): 309- 332
8	BOGIN B, BERANT J, GARDNER M. Representing schema structure with graph neural networks for text-to-SQL parsing [C]// Proceedings of ACL . Florence: ACL, 2019: 4560-4565.
9	BOGIN B, GARDNER M, BERANT J. Global reasoning over database structures for text-to-SQL parsing [C]// Proceedings of EMNLP-IJCNLP 2019 . Hong Kong: ACL, 2019: 3659-3664.
10	ZHONG V, XIONG C, SOCHER R. Seq2SQL: generating structured queries from natural language using reinforcement learning [EB/OL]. (2017-11-09) [2023-10-27]. https://arxiv.org/abs/1709.00103.
11	WANG B, SHIN R, LIU X, et al. RAT-SQL: relation-aware schema encoding and linking for text-to-SQL parsers [C]// Proceedings of ACL . [S. l. ]: ACL, 2020: 7567-7578.
12	VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need [C]// Advances in Neural Information Processing Systems . Long Beach: MIT Press, 2017: 5998–6008.
13	LIN X V, SOCHER R, XIONG C. Bridging textual and tabular data for cross-domain text-to-SQL semantic parsing [C]// Findings of the ACL: EMNLP 2020 . [S. l.]: ACL, 2020: 4870-4888.
14	CAO R, CHEN L, CHEN Z, et al. Line graph enhanced text-to-SQL model with mixed local and non-local relations [C]// Proceedings of ACL-IJCNLP 2021 . [S. l.]: ACL, 2021: 2541-2555.
15	CAI R, YUAN J, XU B, et al Sadga: Structure-aware dual graph aggregation network for text-to-sql[J]. Advances in Neural Information Processing Systems, 2021, 34: 7664- 7676
16	HUI B, GENG R, WANG L, et al. S2SQL: injecting syntax to question-schema interaction graph encoder for Text-to-SQL parsers [C]// Findings of the ACL: ACL 2022 . Dublin: ACL, 2022: 1254-1262.
17	QI J, TANG J, HE Z, et al. Rasat: Integrating relational structures into pretrained seq2seq model for text-to-sql [C]// Proceedings of the 2022 Conference on EMNLP . Abu Dhabi: ACL, 2022: 3215-3229.
18	RAFFEL C, SHAZEER N, ROBERTS A, et al Exploring the limits of transfer learning with a unified text-to-text transformer[J]. The Journal of Machine Learning Research, 2020, 21 (1): 5485- 5551
19	SHAW P, CHANG M W, PASUPAT P, et al. Compositional generalization and natural language variation: can a semantic parsing approach handle both? [C]// Proceedings of ACL-IJCNLP 2021 . [S. l. ]: ACL, 2021: 922–938.
20	SCHOLAK T, SCHUCHER N, BAHDANAU D. PICARD: parsing incrementally for constrained auto-regressive decoding from language models [C]// Proceedings of the 2021 Conference on EMNLP . Punta Cana: ACL, 2021: 9895-9901.
21	GAO C, LI B, ZHANG W, et al. Towards generalizable and robust Text-to-SQL parsing [C]// Findings of the ACL: EMNLP 2022 . Abu Dhabi: ACL, 2022: 2113-2125.
22	XIE T, WU C H, SHI P, et al. Unifiedskg: Unifying and multi-tasking structured knowledge grounding with text-to-text language models [C]// Proceedings of the 2022 Conference on EMNLP . Abu Dhabi: ACL, 2022: 602-631.
23	LI J, HUI B, CHENG R, et al. Graphix-T5: mixing pre-trained Transformers with graph-aware layers for text-to-SQL parsing [EB/OL]. (2023-01-18) [2023-10-27]. https://arxiv.org/abs/2301.07507.
24	DEVLIN J, CHANG M W, LEE K, et al. BERT: pre-training of deep bidirectional transformers for language understanding [C]// Proceedings of NAACL-HLT 2019 . Minneapolis: ACL, 2019: 4171–4186.
25	YU T, ZHANG R, YANG K, et al. Spider: a large-scale human-labeled dataset for complex and cross-domain semantic parsing and Text-to-SQL task [C]// Proceedings of the 2018 Conference on EMNLP . Brussels: ACL, 2018: 3911–3921.
26	GAN Y, CHEN X, PURVER M. Exploring underexplored limitations of cross-domain Text-to-SQL generalization [C]// Proceedings of EMNLP . [S. l. ]: ACL, 2021: 8926–8931.
27	GAN Y, CHEN X, HUANG Q, et al. Towards robustness of Text-to-SQL models against synonym substitution [C]// Proceedings of ACL-IJCNLP. [S. l. ]: ACL , 2021: 2505-2515.
28	SHAZEER N, STERN M. Adafactor: adaptive learning rates with sublinear memory cost [C]// Proceedings of the 35th International Conference on Machine Learning , Stockholm: PMLR, 2018: 4596-4604.

[1]	何佳壕,刘喜平,舒晴,万常选,刘德喜,廖国琼. 带复杂计算的金融领域自然语言查询的SQL生成[J]. 浙江大学学报(工学版), 2023, 57(2): 277-286.

Viewed

Full text

Abstract

Cited

Shared

Discussed