In order to improve the detection and filtering ability for financial data, a data-filtering method based on mathematical probability statistical model, a descriptive statistical analysis model of big data filtering, probability density characteristic statistical design data filtering analysis combined with fuzzy mathematical reasoning, regression analysis according to probability density of financial data distribution, and threshold test and threshold judgment are conducted to realize data filtering. The test results show that the big data filtering and the reliability and convergence of the mathematical model are optimal.
Keywords
- mathematical probability model
- data filtering
- financial data
- inspection statistics
AMS 2010 codes
- 91-04
As the size of the big data information is increasing, big data containing different types of information require effective filtration and separation of different big data, improvement in the storage and management capabilities of data, optimization of the storage space of data, and improvement in large data information identification. Ability research on information classification methods and filtration methods for big data is of great significance in large database construction and cloud storage model design.
The filter model for data is based on mathematical statistical analysis and applies the statistical regression analysis model of data filtration. Using probability distribution and adaptive fuzzy clustering analysis methods, we optimize the filtration of data. Typical large data filtration models mainly include quantitative regression analysis model, autoregressive moving average (ARMA) model, correlation statistical analysis model, Backlund model etc. Based on the design of the probability mathematical model, combined with the pattern identification and the characteristic clustering method, data filtering is done; but the traditional method has a large interference with mode identification during data filtration. In response to the above problem, this paper proposes a data filtration method based on the probability mathematical model, and the statistical characteristic analysis model for large data filtration is constructed, combined with mathematical modeling and statistical analysis methods, and the data filter probability mathematical model is optimized. Finally, data analysis is performed to obtain conclusions regarding the effectiveness of the model [1, 2].
Financial data are typical time series data, which contain financial data, exchange rate interest rates, futures prices, etc. and various forms of one-dimensional or multidimensional data, which are characterized by nonlinearity of dynamics. Analysis of financial data is not only a hot problem of social hotspots but also a subject of academic community research. For decades, a large number of scholars have made great contributions to the development of the study, whether economics, statistics, or computer science. There are many kinds of financial data, such as stock, futures, exchange rates, etc., characterized by nonlinear dynamic changes, caused also because of the social development and economic policies of various countries. Correspondingly, a variety of research methods are more likely. Data are endless, and when they are financial, statistical or computer scholars hope to establish a relatively reliable model through a large amount of financial data to predict and analyze financial markets, thereby reducing investors’ mistakes in transaction decision-making, avoiding operational risks.
At the same time, these models explore the internal law of the financial market, providing an efficient solution for managers and decision-makers in the financial market to improve the market economy system. The statistical model of real-world data usually involves continuous probability distribution, such as normal, Laplace, or index distribution. This distribution is supported by many probabilistic modeling forms, including probability database systems. However, the traditional theoretical framework of the probability database is completely concerned with the limited probability database [3]. Yue et al. [4] developed mathematical theories in unlimited probability databases. In a previous paper (GROHE, LINDNER; ICDT 2020), a very general framework is proposed for the probability database that may involve a continuous probability distribution; the query has a clear definition in this framework. Extending the probability programming language to generate data records (PODS 2020) to achieve a continuous probability distribution and displaying such programs generates a continuous probability database generation model [4]. Zhang [5] has developed a new way to have a nonprogrammetry of boundaries to accurately surround all uncertain parameter data extracted in the actual project. This method is based on conventional statistical methods and related analysis techniques. First, the average value and the correlation coefficient of the uncertain parameter are calculated by using all given data.
Then, first, a simple but effective optimization process is introduced in the mathematical modeling process to induce the uncertain parameters to obtain their exact boundaries. This process operates on all of the given data by optimizing the convex mode. Thus, an effective mathematical expression of the final formulation of the convex mode is obtained. In order to test the predictive capacity and generalization ability of the proposed convex mode, the evaluation criteria used are volume ratio, standard volume ratio, and prediction accuracy. The performance of the proposed method is studied by testing standards and comparing with other existing competition methods. The results prove the effectiveness and efficiency of the method [5]. Herrera-Vega et al. [6] propose a probabilistic hesitant financial data (PHFD)-based portfolio selection and risk portfolio selection model, i.e., probabilistic hesitant portfolio selection (PHPS) model and risk PHPS (RPHPS) model, of risk probability hesitation.
In addition, investment decision methods are provided to show their practical applications in the financial market. It is expected that the PHPS model of general investors is built based on the highest score or minimum deviation principle to obtain the best investment ratio, and the RPHPS model provides the best investment ratio for the best return or to undertake risk. Finally, an empirical study based on the actual data of China’s stock market is shown in detail. The results verify the effectiveness and practicality of the proposed method [6]. Li et al. used mathematical statistical methods to study the effectiveness of single-stage and multistage hydraulic fracturing technology. These studies are based on the geological and physical properties of the circulation of the prime wall, which makes it possible to demonstrate the choice of hydraulic fracturing technology.
Based on the probability and statistical analysis of the field efficiency data, quantitative standards selected by the crack parameters of propanes by length, height, and the injection of propane have been established. They can be used to predict the use of multistage and single-stage hydraulic fracturing methods. High-efficiency horizon AS12-3 has complex geological and physical structures and low filtering and storage performance, making it complicated, and contributes to the active formation of the reserves that are difficult to recover. Single-stage hydraulic fracturing technology is only 3–4 years old. Hydraulic fracturing technology with single-stage hydraulic fracturing technology is more effective [7].
The mathematical modeling analysis of data filtration is performed by the probability mathematical model. First, the sample descriptive statistical analysis method is used to analyze large data samples, the sampled major households are screened, and the statistical feature of the characteristic attribute that can effectively reflect the data filtration is determined. Sample regression analysis and adaptive processing on the sampling data are performed; the combined information fusion method implements the pattern identification of data filtration, and regression analysis is conducted according to the probability density of the detection statistic distribution in massive data environments; thus, data filtration according to the mode identification results. According to the above analysis, the overall structure of the probability mathematical model of data filtration is shown in Figure 1.
Fig. 1
Overall design of the probability mathematical model of data filtration.

According to the probability mathematical model shown in Figure 1, the mathematical information flow analysis of data filtration is performed, and the sample time series of the sampled data information stream is assumed.
Data filtration statistical analysis in Leslie–Gower space is performed. The associated property rule analysis method is used, and the algorithm for data filtration is designed. The feature extraction algorithm for data filtration is described as follows.
Enter: construct data filtering table with minimum multiplining method
Step 1:
Step 2:
Step 3:
Step 4:
Step 5:
Step 6:
Step 7:
Step 8:
Step 9:
Step 10: Fusion decisions are performed within a limited field
Step 11: Descriptive statistics matrix for output data filtration [8].
Looking for a series of factors affecting financial data change as an input of the HMM, i.e., observed sequences. The observed value belongs to the mixed Gaussian distribution; then:
where
For
where
On the basis of constructing a statistical characteristic analysis model for large data filtration, the optimization design of data filtering model is performed, and this paper proposes a data filtration method based on the probability mathematical model and gives data filtration probability density feature distribution mapping
In the formulas
The information entropy feature extraction is performed according to the association of the attribute. The scalar data time sequence attribute of data filtering is
Assuming that the decision threshold of
The probability mathematical model of data filtration is constructed using a fuzzy decision method, using a threshold test and threshold decision method, to achieve large data filtration, and the implementation process of the algorithm is as follows:
1. //Set the primary value of the probability mathematical model of data filtering
2.
3.
4. //Calculate data characteristics positioning matrix to get a fuzzy decision power index
5.
6.
7. else
8.
9. //With an independent main component analysis method, the fuzzy decision and probability statistics of data filtering are obtained, and the fuzzy identification matrix of data filtration output is obtained
10.
11.
12. //use SVD breakout operation method, power index statistics, execute the following cycle.
13.
14.
15. End
16. Feature statistic for data filtering output
17.
18. //End of iteration, realize the statistical regression analysis of large data filtration.
19.
20.
21. Seeking the main feature
22. If the convergence condition is met, the algorithm ends.
In order to test the application performance in data filtration, the data set of data sample distribution is database2016; the test data set is CSLOGS SET2, and the data sample number is
Classification attribute set of data filtration
53 | 2 | 2 | ||
3 | 6 | 3 | 3 | |
3 | 4 | 4 | 3 | |
53 | 6 | 3 | ||
3 | 2 | 5 | 4 | |
25 | 4 | 5 | ||
6 | 6 | 5 | 4 | |
6 | 5 | 2 | 3 | |
7 | 3 | 1 | 4 | |
5 | 4 | 1 | 5 |
According to the statistical rules of the large data sheet described above, the data are filtered, and data filtration is obtained as shown in Table 2.
Detection statistics rules table for data filtration
1 | 4 | * | 2 | 1 |
2 | 3 | * | 3 | 5 |
3 | 6 | * | 2 | 4 |
4 | * | 4 | 6 | 4 |
5 | 3 | * | 5 | 5 |
6 | 4 | 4 | * | 4 |
7 | 5 | * | 6 | 5 |
8 | 3 | 4 | 6 | 6 |
According to the statistical rule table, based on the above data filtration, data filtration is achieved, and the error rate of filtration is obtained, e.g., as shown in Figure 2; Figure 2 is analyzed and the accuracy of data filtration in this method is derived; the error is low.
Fig. 2
Performance comparison of data filtration.

Effective filtration and separation of different types of big data improve data storage and management capabilities, optimize data storage space, and improve large data information recognition capability. This paper proposes a data filtration method based on the probability mathematical model, using descriptive statistical analysis method, the statistical feature analysis model of large data filtration, combined with mathematical modeling and statistical analysis, implementation of data filter probability mathematical model, using threshold test and threshold decision methods to implement data filtering. Studies show that the method of data filtration is better and the error is low.
Fig. 1

Fig. 2

Fig. 1

Fig. 2

Detection statistics rules table for data filtration
1 | 4 | * | 2 | 1 |
2 | 3 | * | 3 | 5 |
3 | 6 | * | 2 | 4 |
4 | * | 4 | 6 | 4 |
5 | 3 | * | 5 | 5 |
6 | 4 | 4 | * | 4 |
7 | 5 | * | 6 | 5 |
8 | 3 | 4 | 6 | 6 |
Classification attribute set of data filtration
53 | 2 | 2 | ||
3 | 6 | 3 | 3 | |
3 | 4 | 4 | 3 | |
53 | 6 | 3 | ||
3 | 2 | 5 | 4 | |
25 | 4 | 5 | ||
6 | 6 | 5 | 4 | |
6 | 5 | 2 | 3 | |
7 | 3 | 1 | 4 | |
5 | 4 | 1 | 5 |
Law of interest rate changes in financial markets based on the differential equation model of liquidity Basalt fibre continuous reinforcement composite pavement reinforcement design based on finite element model Industrial transfer and regional economy coordination based on multiple regression model Response model for the psychological education of college students based on non-linear finite element equations Satisfactory consistency judgement and inconsistency adjustment of linguistic judgement matrix Analysis of the relationship between industrial agglomeration and regional economic growth based on the multi-objective optimisation model Constraint effect of enterprise productivity based on constrained form variational computing The impact of urban expansion in Beijing and Metropolitan Area urban heat Island from 1999 to 2019 Ultrasonic wave promoting ice melt in ice storage tank based on polynomial fitting calculation model Regarding new wave distributions of the non-linear integro-partial Ito differential and fifth-order integrable equations Badminton players’ trajectory under numerical calculation method Innovations to Attribute Reduction of Covering Decision System Based on Conditional Information Entropy Nonlinear Differential Equations in the Teaching Model of Educational Informatisation The evaluation of college students’ innovation and entrepreneurship ability based on nonlinear model Smart Communities to Reduce Earthquake Damage: A Case Study in Xinheyuan, China Institutional investor company social responsibility report and company performance Mathematical analysis of China's birth rate and research on the urgency of deepening the reform of art education First-principles calculations of magnetic and mechanical properties of Fe-based nanocrystalline alloy Fe80Si10Nb6B2Cu2 Has the belt and road initiative boosted the resident consumption in cities along the domestic route? – evidence from credit card consumption Attitude control for the rigid spacecraft with the improved extended state observer Cognitive Computational Model Using Machine Learning Algorithm in Artificial Intelligence Environment Research on tourism income index based on ordinary differential mathematical equation Application of Higher-Order Ordinary Differential Equation Model in Financial Investment Stock Price Forecast Sports Science Teaching of Athletics Based on Nonlinear Mathematical Equation Informatisation of educational reform based on fractional differential equations Research on the control of quantitative economic management variables under the numerical method based on stochastic ordinary differential equations Network monitoring and processing accuracy of big data acquisition based on mathematical model of fractional differential equation System dynamics model of output of ball mill Sensitivity Analysis of the Waterproof Performance of Elastic Rubber Gasket in Shield Tunnel Design of Morlet wavelet neural network to solve the non-linear influenza disease system Motion about equilibrium points in the Jupiter-Europa system with oblateness Badminton players’ trajectory under numerical calculation method Optimal preview repetitive control for impulse-free continuous-time descriptor systems Development of main functional modules for MVB and its application in rail transit Study on the impact of forest fire prevention policy on the health of forest resources Value Creation of Real Estate Company Spin-off Property Service Company Listing Selection by differential mortality rates Digital model creation and image meticulous processing based on variational partial differential equation The modelling and implementation of the virtual 3D animation scene based on the geometric centre-of-mass algorithm The policy efficiency evaluation of the Beijing–Tianjin–Hebei regional government guidance fund based on the entropy method The transfer of stylised artistic images in eye movement experiments based on fuzzy differential equations Research on behavioural differences in the processing of tenant listing information: An eye-movement experiment A review of the treatment techniques of VOC Some classes of complete permutation polynomials in the form of ( x p m −x +δ )s +ax p m +bx overF p 2m Deformation and stress theory of surrounding rock of shallow circular tunnel based on complex variable function method The consistency method of linguistic information and other four preference information in group decision-making Research on the willingness of Forest Land’s Management Rights transfer under the Beijing Forestry Development A mathematical model of the fractional differential method for structural design dynamics simulation of lower limb force movement step structure based on Sanda movement Fractal structure of magnetic island in tokamak plasma Numerical calculation and study of differential equations of muscle movement velocity based on martial articulation body ligament tension Study on the maximum value of flight distance based on the fractional differential equation for calculating the best path of shot put Sports intensity and energy consumption based on fractional linear regression equation Translog function in government development of low-carbon economy Analysis of the properties of matrix rank and the relationship between matrix rank and matrix operations Research on the Psychological Distribution Delay of Artificial Neural Network Based on the Analysis of Differential Equation by Inequality Expansion and Contraction Method Study on Establishment and Improvement Strategy of Aviation Equipment Research on Financial Risk Early Warning of Listed Companies Based on Stochastic Effect Mode Constructing Artistic Surface Modeling Design Based on Nonlinear Over-limit Interpolation Equation Numerical Simulation Analysis Mathematics of Fluid Mechanics for Semiconductor Circuit Breaker Characteristics of Mathematical Statistics Model of Student Emotion in College Physical Education Human Body Movement Coupling Model in Physical Education Class in the Educational Mathematical Equation of Reasonable Exercise Course The contribution of structural equation model analysis to higher education agglomeration and innovation and entrepreneurship Study on the evolutionary game theory of the psychological choice for online purchase of fresh produce under replicator dynamics formula The influence of X fuzzy mathematics method in basketball tactics scoring Mathematical statistics algorithm in the bending performance test of corroded reinforced concrete beams under fatigue load Nonlinear strategic human resource management based on organisational mathematical model Back propagation mathematical model for stock price prediction Evolutionary game research on the psychological choice of online shopping of fresh agricultural products based on dynamic simulation model Differential equation model of financial market stability based on big data Multi-attribute decision-making methods based on normal random variables in supply chain risk management Linear fractional differential equations in bank resource allocation and financial risk management model Construction and reform of art design teaching mode under the background of the integration of non-linear equations and the internet Spatial–temporal graph neural network based on node attention A contrastive study on the production of double vowels in Mandarin Financial accounting measurement model based on numerical analysis of rigid normal differential equation and rigid generalised functional equation Research of cascade averaging control in hydraulic equilibrium regulation of heating pipe network Mathematical analysis of civil litigation and empirical research of corporate governance Health monitoring of Bridges based on multifractal theory College students’ innovation and entrepreneurship ability based on nonlinear model Health status diagnosis of the bridges based on multi-fractal de-trend fluctuation analysis Mathematical simulation analysis of optimal testing of shot puter's throwing path Performance evaluation of college laboratories based on fusion of decision tree and BP neural network Application and risk assessment of the energy performance contracting model in energy conservation of public buildings The term structure of economic management rate under the parameter analysis of the estimation model based on common differential equation Sensitivity analysis of design parameters of envelope enclosure performance in the dry-hot and dry-cold areas The Spatial Form of Digital Nonlinear Landscape Architecture Design Based on Computer Big Data The improvement of museum information flow based on paste functional mapping method The art design of industrialised manufacturing furniture products based on the simulation of mathematical curves TOPSIS missile target selection method supported by the posterior probability of target recognition The incentive contract of subject librarians in university library under the non-linear task importance Modelling and Simulation of Collaborative Innovation System in Colleges and Universities Based on Interpreted Structural Equation Model Small amplitude periodic solution of Hopf Bifurcation Theorem for fractional differential equations of balance point in group competitive martial arts The Optimal Solution of Feature Decomposition Based on the Mathematical Model of Nonlinear Landscape Garden Features Composite mechanical performance of prefabricated concrete based on hysteresis curve equation Higher education innovation and reform model based on hierarchical probit Application of Fuzzy Mathematics Calculation in Quantitative Evaluation of Students’ Performance of Basketball Jump Shot The teaching of sports science of track and field-based on nonlinear mathematical equations Visual error correction of continuous aerobics action images based on graph difference function Ecological balance model of effective utilization of agricultural water resources based on fractional differential equations Application of Higher Order Ordinary Differential Equation Model in Financial Investment Stock Price Forecast Precision algorithms in second-order fractional differential equations Application of Forced Modulation Function Mathematical Model in the Characteristic Research of Reflective Intensity Fibre Sensors Fractional differential equations in National Sports Training in Colleges and Universities Radioactive source search problem and optimisation model based on meta-heuristic algorithm Application of mathematical probabilistic statistical model of base – FFCA financial data processing Research on a method of completeness index based on complex model Distribution network monitoring and management system based on intelligent recognition and judgement Fake online review recognition algorithm and optimisation research based on deep learning Research on the sustainable development and renewal of Macao inner harbour under the background of digitisation Support design of main retracement passage in fully mechanised coal mining face based on numerical simulation Study on the crushing mechanism and parameters of the two-flow crusher Topological optimisation technology of gravity dam section structure based on ANSYS partial differential equation operation Interaction design of financial insurance products under the Era of AIoT Modeling the pathway of breast cancer in the Middle East Corporate social responsibility fulfilment, product-market competition and debt risk: Evidence from China ARMA analysis of the green innovation technology of core enterprises under the ecosystem – Time series data Reconstruction of multimodal aesthetic critical discourse analysis framework Image design and interaction technology based on Fourier inverse transform What does students’ experience of e-portfolios suggest Research on China interregional industrial transformation slowdown and influencing factors of industrial transformation based on numerical simulation The medical health venture capital network community structure, information dissemination and the cognitive proximity The optimal model of employment and entrepreneurship models in colleges and universities based on probability theory and statistics A generative design method of building layout generated by path Analysis of the causes of the influence of the industrial economy on the social economy based on multiple linear regression equation Research of neural network for weld penetration control Engineering project management based on multiple regression equation and building information modelling technology Research on predictive control of students’ performance in PE classes based on the mathematical model of multiple linear regression equation Beam control method for multi-array antennas based on improved genetic algorithm The influence of X fuzzy mathematical method on basketball tactics scoring Mathematical model of back propagation for stock price forecasting Application of regression function model based on panel data in bank resource allocation financial risk management Application of Logical Regression Function Model in Credit Business of Commercial Banks Research on aerobics training posture motion capture based on mathematical similarity matching statistical analysis Application of Sobolev-Volterra projection and finite element numerical analysis of integral differential equations in modern art design Research on motion capture of dance training pose based on statistical analysis of mathematical similarity matching Application of data mining in basketball statistics Application of B-theory for numerical method of functional differential equations in the analysis of fair value in financial accounting Research on the influence of fuzzy mathematics simulation model in the development of Wushu market Study on audio-visual family restoration of children with mental disorders based on the mathematical model of fuzzy comprehensive evaluation of differential equation Difference-in-differences test for micro effect of technological finance cooperation pilot in China Application of multi-attribute decision-making methods based on normal random variables in supply chain risk management Exploration on the collaborative relationship between government, industry, and university from the perspective of collaborative innovation The impact of financial repression on manufacturing upgrade based on fractional Fourier transform and probability AtanK-A New SVM Kernel for Classification Validity and reliability analysis of the Chinese version of planned happenstance career inventory based on mathematical statistics Visual positioning system for marine industrial robot assembly based on complex variable function Mechanical behaviour of continuous girder bridge with corrugated steel webs constructed by RW Application of Lane-Emden differential equation numerical method in fair value analysis of financial accounting Regression function model in risk management of bank resource allocation Application of numerical method of functional differential equations in fair value of financial accounting Optimal solution of fractional differential equations in solving the relief of college students’ mental obstacles Risk contagion in financial markets based on copula model Calculating university education model based on finite element fractional differential equations and macro-control analysis Educational research on mathematics differential equation to simulate the model of children's mental health prevention and control system Analysis of enterprise management technology and innovation based on multilinear regression model Verifying the validity of the whole person model of mental health education activities in colleges based on differential equation RETRACTION NOTE Calculation of tourism development income index based on finite element ordinary differential mathematical equation Adoption of deep learning Markov model combined with copula function in portfolio risk measurement Radar system simulation and non-Gaussian mathematical model under virtual reality technology Comparison of compression estimations under the penalty functions of different violent crimes on campus through deep learning and linear spatial autoregressive models Research and application of constructing football training linear programming based on multiple linear regression equation Research on management evaluation of enterprise sales cash flow percentage method based on the application of quadratic linear regression equations Mathematical simulation analysis of optimal detection of shot-putters’ best path Determination of the minimum distance between vibration source and fibre under existing optical vibration signals: a study Mathematical modelling of enterprise financial risk assessment based on risk conduction model Nonlinear differential equations based on the B-S-M model in the pricing of derivatives in financial markets Mathematical simulation experiment based on optimisation of heat treatment process of aluminium alloy materials Mathematical model of transforming image elements to structured data based on BP neural network Educational reform informatisation based on fractional differential equation MCM of Student’s Physical Health Based on Mathematical Cone Sports health quantification method and system implementation based on multiple thermal physiology simulation Research on visual optimization design of machine–machine interface for mechanical industrial equipment based on nonlinear partial equations Informationisation of teaching model for track and field education based on finite element higher-order fractional differential equation Research on identifying psychological health problems of college students by logistic regression model based on data mining Information technology of preschool education reform of fine arts based on fractional differential equation Information Teaching Model of Preschool Art Education in Colleges and Universities Based on Finite Element Higher-Order Fractional Differential Equation A mathematical model of PCNN for image fusion with non-sampled contourlet transform Application of artificial intelligence algorithm in mathematical modelling and solving College Students’ Mental Health Climbing Consumption Model Based on Nonlinear Differential Equations Differential equation to verify the validity of the model of the whole-person mental health education activity in Universities Optimisation of Modelling of Finite Element Differential Equations with Modern Art Design Theory Analysis and Prediction of College Students’ Mental Health Based on K-means Clustering Algorithm Mathematical function data model analysis and synthesis system based on short-term human movement Human gait modelling and tracking based on motion functionalisation Analysis and synthesis of function data of human movement Energy-saving technology of BIM green buildings using fractional differential equation Study on the training model of football movement trajectory drop point based on fractional differential equation Financial Accounting Measurement Model Based on Numerical Analysis of Rigid Normal Differential Equation and Rigid Functional Equation User online consumption behaviour based on fractional differential equation Differential equation model of financial market stability based on Internet big data Multi-attribute Decision Method Based on Normal Random Variable in Economic Management Risk Control Children’s cognitive function and mental health based on finite element nonlinear mathematical model Dichotomy model based on the finite element differential equation in the educational informatisation teaching reform model Nonlinear Dissipative System Mathematical Equations in the Multi-regression Model of Information-based Teaching Stock price analysis based on the research of multiple linear regression macroeconomic variables Fractional Linear Regression Equation in Agricultural Disaster Assessment Model Based on Geographic Information System Analysis Technology