The boxplots were used to demonstrate all the independent 
variables X visually and verify the positive skewed trend. 
Table 2: Confusion matrix of the model. 
  Reference 
predict  0  1 
0  105  17 
1  10  45 
After  fitting  the  best  lambda,  we  create  a 
confusion  matrix  to  evaluate  the  accuracy  of  our 
modeling.  Our  data  are  divided  into  two  parts  in 
which the training part contains 70 percent of the data 
and the test part contains 30 percent. The reference 
means the true value and the prediction represents the 
value that the model predicted. 
4  DISCUSSION 
Pancreatic cancer is a highly malignant tumor of the 
digestive system, and the molecular mechanism of its 
occurrence and progression is still uncertain. In this 
article,  we  are  interested  in  the  early  detection, 
prediction  and  diagnosis  of  pancreatic  cancer.  We 
have analyzed and discussed again based on the data 
of  previous  researchers,  trying  to  explore  which 
factors are related to pancreatic cancer, but it still has 
certain limitation. 
We detect five  urinary biomarkers in this  study. 
Lymphatic vessel endothelial hyaluronan receptor 1 
(LYVE1) is a receptor that binds to both soluble and 
immobilised hyaluronan. LYVE1 plays an important 
role  in  lymphatic  hyaluronan  transport  and  tumor 
metastasis.  Regenerating  family  member  1  beta 
(REG1B) belongs  to  a  family  of  glycoproteins and 
may  promote  regeneration  of  pancreatic  islets. 
Regenerating family member 1 alpha (REG1A) is a 
protein which is highly similar to REG1B (Frappart 
and Hofmann, 2020). Trefoil factor 1 (TFF1) is a 6.5 
kDa  secreted  protein  that  belongs  to  a  family  of 
gastrointestinal  secretory  peptides.  It  is  expressed 
predominantly in normal gastric mucosa and involved 
in the regeneration and repair of urinary tract. TFF1 
plays an important role in the development of cancer. 
Creatinine is a protein which is a product of muscle 
metabolism and is primarily cleared by the kidneys.   
There are still many factors that are not included 
in the database that can still affect the incidence and 
prediction of PDAC to a large extent. Firstly, HER2 
may  play  an  important  role  in  the  occurrence  and 
development of pancreatic ductal adenocarcinoma in 
elderly  patients.  The  overexpression  rate  of  HER2 
may be  related to gender,  but  its mechanism  needs 
further study (Ballehaninna and Chamberlain, 2012). 
Secondly, we still have a lot to learn from in research 
methods.  In  known  studies,  including  drawing 
survival curves based on the Kaplan-Meier method, 
comparing survival time differences using Log-rank 
test, multivariate Cox regression analysis to assess the 
risk  factors  affecting  patient  survival,  etc.,  can  be 
used  to  obtain  better  results.  good  result.  In  future 
research, we will continue to work hard to bring better 
research and results. 
5  CONCLUSION 
In our work, it can be concluded that age, LYVE1, 
REG1A,  REG1B,  and  the  interaction  between 
creatinine and REG1A are the key predictors for the 
diagnosis  of  pancreatic  cancer.  Their  performances 
are  successfully  validated  by  confusion  matrix. 
Furthermore,  we  plan  to  search  for  more  clinical 
datasets to  verify our  model and  apply our  logistic 
regression  approach  to  more  available  datasets  of 
cardiovascular diseases and other types of cancer. 
REFERENCES 
Adamska,  A.,  Domenichini,  A.,  &  Falasca,  M.  (2017). 
Pancreatic  Ductal  Adenocarcinoma:  Current  and 
Evolving Therapies. International journal of molecular 
sciences, 18(7),  1338.   
https://doi.org/10.3390/ijms18071338 
Arnold,  M.,  Rutherford,  M.  J.,  Bardot,  A.,  Ferlay,  J., 
Andersson,  T.  M.,  Myklebust,  T.  Å.,  Tervonen,  H., 
Thursfield, V., Ransom, D., Shack, L., Woods, R. R., 
Turner, D., Leonfellner, S., Ryan, S., Saint-Jacques, N., 
De,  P.,  McClure,  C.,  Ramanakumar,  A.  V.,  Stuart-
Panko, H., Engholm, G., … Bray, F. (2019). Progress 
in  cancer  survival,  mortality,  and  incidence  in  seven 
high-income  countries  1995-2014  (ICBP 
SURVMARK-2):  a  population-based  study. The 
Lancet. Oncology, 20(11),  1493–
1505.https://doi.org/10.1016/S1470-2045(19)30456-5 
Ballehaninna,  U.  K.,  &  Chamberlain,  R.  S.  (2012).  The 
clinical  utility  of  serum  CA  19-9  in  the  diagnosis, 
prognosis  and  management  of  pancreatic 
adenocarcinoma: An evidence based appraisal. Journal 
of gastrointestinal oncology, 3(2),  105–119. 
https://doi.org/10.3978/j.issn.2078-6891.2011.021 
Bray, F., Ferlay, J., Soerjomataram, I., Siegel, R. L., Torre, 
L.  A.,  &  Jemal,  A.  (2018).  Global  cancer  statistics 
2018:  GLOBOCAN  estimates  of  incidence  and 
mortality  worldwide  for  36  cancers  in  185 
countries. CA:  a cancer journal for clinicians, 68(6), 
394–424. https://doi.org/10.3322/caac.21492