Multimodal detection framework for financial fraud integrating LLMs and interpretable machine learning
, , y
01 sept 2025
Acerca de este artículo
Categoría del artículo: Research Papers
Publicado en línea: 01 sept 2025
Recibido: 17 abr 2025
Aceptado: 18 ago 2025
DOI: https://doi.org/10.2478/jdis-2025-0046
Palabras clave
© 2025 Hui Nie et al., published by Sciendo
This work is licensed under the Creative Commons Attribution 4.0 International License.
Figure 1.

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Corporate Governance Indicators (11 Items)_
Indicator Type | Indicator |
Variable Definition and Value |
---|---|---|
Ownership |
First Largest Shareholder Ownership Ratio |
|
Internal |
Whether serving as both Chairman and CEO (l=Yes, 0=No) |
Annual report feature indicators_
Indicator Type | Indicator Variable | Variable Definition |
---|---|---|
Tone | LM_tone |
Tone value of the full annual report, calculated based on the LM dictionary. |
Semantic |
Semantic feature vector extracted from the MD&A ( |
|
Readability | Character count of the annual report’s MD&A section |
Ablation analysis on multi-feature fusion_
Features | AUC | ΔAUC | Accuracy | ΔAccuracy | F1 | ΔF1 |
---|---|---|---|---|---|---|
Financial | 0.820 | - | 0.740 | - | 0.694 | - |
Financial + Textual | 0.880 | 0.798 | +5.8% | 0.780 | + |
|
Financial + Textual |
Experimental results of word embedding models (Input Dimension 256)_
Embedding model | Introduction | Accuracy | AUC | F1 |
---|---|---|---|---|
A high-performance text embedding model developed by ByteDance’s Volcano Engine, part of the Doubao large model ecosystem, specializing in generating high-quality semantic vectors. | 0.742 | 0.817 | 0.733 | |
Baidu’s general-purpose text embedding model, incorporating the company’s NLP expertise. It supports Chinese and multilingual tasks, generating high-dimensional dense vectors (1024 dimensions). | 0.738 | 0.812 | 0.715 | |
An open-source Chinese text embedding model from Beijing Academy of Artificial Intelligence (BAAI). Fine-tuned on RoBERTa-large architecture, it delivers excellent performance on Chinese text. | 0.739 | 0.802 | 0.709 | |
A classic open-source Chinese text embedding model based on transformer architecture, providing static contextual embeddings. | 0.609 | 0.642 | 0.577 |
Sample summaries of corporate annual reports_
Chinese | English |
---|---|
藏格控股 2019 年年度报告显示, 管理层围绕年度目标努力基本完成任务。当年钾肥生产 108.29 万吨, 营收 20.18 亿元, 净利润 3.08 亿元, 业绩下降因产品销量、价格、产量及成本、存货跌价等因素。 | The 2019 annual report of Zangge Holdings indicates that the management team largely accomplished its annual objectives. Potassium fertilizer production reached 1.0829 million tons, with revenue of 2.018 billion yuan and net profit of 308 million yuan. The decline in performance was attributed to factors such as product sales volume, price, production volume, costs, and inventory depreciation. |
报告期主要工作包括推进电池级碳酸锂项目投产、改造技术工艺降成本、坚持创新发展获补助资金。多项财务数据有变动, 如营业收入合计同比减 24.70%, 销售、管理、财务、研发费用有不同变化。 | Key initiatives during the reporting period included advancing the commissioning of the battery-grade lithium carbonate project, improving technical processes to reduce costs, and securing subsidy funds through innovation-driven development. Several financial metrics showed changes, such as a 24.70% year-on-year decrease in total operating revenue, with variations in sales, management, financial, and R&D expenses. |
研发投入大幅增长, 现金流方面各活动现金流量有升降。非主营业务投资收益为负, 还涉及资产减值、营业外收支等。资产及负债有比重增减, 获取重大股权投资, 收购西藏巨龙铜业 37%股权, 本期投资亏损 1.68 亿元。 | R&D investment increased significantly, while cash flows from various activities fluctuated. Non-core business investment returns were negative, and the report also covered asset impairments, non-operating income, and expenses. Assets and liabilities experienced shifts in proportions, with significant equity investments, including the acquisition of a 37% stake in Xizang Julong Copper for a loss of 168 million yuan during the period. |
募集资金使用有进展, 曾自筹资金投入募投项目, 2019 年募投项目结项节余补流。公司积累了经验和技术储备, 目前面临钾肥价格、新业务开拓、环保等风险,将采取相应措施应对。还存在股份质押、业绩补偿股份回购注销风险, 补偿义务人正积极处置。 | Progress was made in the use of raised funds, with self-raised capital allocated to fundraising projects. In 2019, surplus funds from completed fundraising projects were redirected to working capital. The company has accumulated experience and technical reserves but faces risks such as potassium fertilizer price fluctuations, new business expansion, and environmental challenges, for which corresponding measures will be implemented. Risks related to share pledges, performance compensation share repurchases, and cancellations remain, with obligors actively addressing them. |
未来公司将在稳步发展氯化钾基础上开发高附加值产品, 利用老卤生产新能源材料, 开发锂系列产品, 加强技术和人才储备等。2019 年多次通过电话沟通方式接待个人咨询, 内容涉及公司多个方面情况, 多数未提供书面报告。 | Moving forward, the company will focus on developing high-value-added products based on stable potassium chloride production, utilizing brine for new energy materials, expanding lithium-based products, and strengthening technical and talent reserves. In 2019, the company responded to numerous individual inquiries via phone consultations, covering various aspects of the company, most of which did not result in written reports. |
Performance comparison of financial fraud prediction models with multi-dimensional embeddings_
Semantic features | Dimension | Accuracy | Precision | Recall | F1 | AUC |
---|---|---|---|---|---|---|
Embedding vectors | 1024 | 0.732 | 0.715 | 0.693 | 0.704 | 0.811 |
512 | 0.732 | 0.706 | 0.714 | 0.710 | 0.805 | |
128 | 0.695 | 0.656 | 0.705 | 0.679 | 0.762 |
Model performance under imbalanced data_
Fraud to Non-Fraud Sample Ratio | Accuracy | AUC | F1 | Precision | Recall |
---|---|---|---|---|---|
1:1 | 0.742 | 0.817 | 0.713 | 0.713 | 0.733 |
1:2 | 0.732 | 0.770 | 0.568 | 0.564 | 0.571 |
1:5 | 0.856 | 0.755 | 0.470 | 0.511 | 0.430 |
1:10 | 0.819 | 0.613 | 0.222 | 0.273 | 0.188 |
1:20 | 0.946 | 0.586 | 0.167 | 0.273 | 0.120 |
Comparative performance of ML algorithms in fraud setection task_
ML Algorithms | AUC | Accuracy | F1 |
---|---|---|---|
LR | 0.750 | 0.685 | 0.679 |
LightGBM | 0.868 | 0.791 | 0.787 |
CatBoost | 0.852 | 0.776 | 0.773 |
Impact of textural features: Ablation analysis_
Features | AUC | ΔAUC | Accuracy | ΔAccuracy | F1 | ΔF1 |
---|---|---|---|---|---|---|
Financial | 0.820 | - | 0.740 | - | 0.694 | - |
Financial+Tone | 0.828 | +0.8% | 0.749 | +0. 9% | 0.727 | +3.3% |
Financial+Tone+Readability | 0.841 | +1.3% | 0.766 | +1.7% | 0.745 | +1.8% |
Financial+Tone+Readability+Semantics |
The ML models’ configuration for fraud detection_
Algorithms | Parameters | Value | Meaning |
---|---|---|---|
Logistic |
C |
100 |
The reciprocal of regularization strength |
XGBoost | subsample |
0.899 |
The sample proportion per tree during training |
LightGBM | colsample_bytree |
0.6 |
Feature proportion per tree |
CatBoost | subsample |
0.8 |
The proportion of samples for training each tree |
Financial Indicators (19 Items)_
Indicator Type | Indicator Variable | Variable Definition and Operation |
---|---|---|
Solvency | Asset Liability Ratio; Total Liabilities / Total Assets |
|
Profitability | Return on Equity (ROE); Net Profit / Total Shareholders’ Equity |
|
Growth Potential | Total Assets Growth Rate; (Current Period Adjusted Figure - Prior Year Same Period Adjusted Figure)/Prior Year Same Period Adjusted Figure for ABS |
|
Operational Efficiency | Accounts Receivable Turnover, Revenue/((Beginning Net Accounts Receivable + Ending Net Accounts Receivable)/2) |
|
Dash Flow Adequacy | Net operating cash flow / Total Revenue |
Corporate fraud type frequency and incidence statistics_
Violation Type | Description | Count | Frequency |
---|---|---|---|
Delayed disclosure | Failure to disclose material information within the prescribed timeframe, violating the timely disclosure requirements under securities regulations. | 72 | 16% |
False statement | Intentional misrepresentation or falsification of financial statements, reports, or other disclosures, including misleading statements that distort the true financial condition or operational performance of the company. | 57 | 13% |
Significant omissions | Deliberate or negligent failure to disclose material information that could significantly impact investors’ decisions, thereby violating the principle of completeness in disclosure. | 51 | 11% |
Inflated assets | Misstating the value or existence of assets in financial statements, including overstating asset values or recording non-existent assets. | 28 | 6% |
Fabricated profits | Artificially inflating or fabricating profits through improper accounting practices, such as recognizing revenue prematurely or manipulating expense records. | 9 | 2% |
Guarantee breaches | Referring to a listed company or its affiliates providing guarantees in violation of regulations or beyond authorized limits, harming the interests of the company and investors. | 8 | 2% |
Misleading disclosures | Providing inaccurate, incomplete, or misleading information in disclosures, whether in financial reports, announcements, or other regulatory filings, which misleads investors or regulators. | 5 | 1% |
Others | - | 78 | 17% |
Semantic feature extraction based on the Doubao LLMs_
LLM | Prompt | Input | Output |
---|---|---|---|
You are an outstanding natural language processing assistant. Please summarize the input text, ensuring that no critical information is lost or misrepresented. The summary should be fluent, logically clear, and complete and not exceed 256 characters. | Chunked text from MA&A (Original text) | Chunked text summary | |
You are an outstanding natural language processing assistant. Please summarize the input text, ensuring that no critical information is lost or misrepresented. The summary should be fluent, logically clear, and complete and not exceed 1024 characters. | Joined summaries of chunked text | Full-text summary | |
None | Full-text summary | Embedding Vector |