Sentence examples for data mining variables from inspiring English sources

Suggestions(1)

data collection variables

Exact(6)

The data mining variables differed for each condition.

In the data mining, variables were selected based on their scientific relevance to the targeted biomarkers of potential harm.

BMC Medical Research Methodology

For identifying LDL cholesterol ≥70 the PPV was 86% (95% CI: 83%, 89%) in the model using pre-specified variables, and increased to 91%9595% CI: 85%, 91%) when adding data mining variables (Table 2 and Additional file 1: Figure S2, Panel B).

BMC Health Services Research

In the model that included pre-specified variables, a predicted probability threshold of 0.55 yielded a PPV of 87%95%5% CI: 85%, 88%) for identifying high risk for CHD, and a sensitivity of 69%95%5% CI: 67%, 70%); results were similar after adding data mining variables (Table 2 and see Additional file 1: Figure S1, Panel A).

BMC Health Services Research

In the model using pre-specified variables, a predicted probability threshold of 0.28 yielded a PPV of 52%95%5% CI: 49%, 54%) for identifying very high risk for CHD events and a sensitivity of 63%95%5% CI: 59%, 66%); results were similar after adding data mining variables (Table 2 and see Additional file 1: Figure S1, Panel B).

BMC Health Services Research

In the model using pre-specified variables, a predicted probability threshold of 0.20 yielded a PPV of 31%95%5% CI: 27%, 36%) for identifying Framingham CHD risk score >20% and a sensitivity of 47%95%5% CI: 43%, 54%); results were similar after adding data mining variables (Table 2 and see Additional file 1: Figure S1, Panel C).

BMC Health Services Research

Similar(54)

Fourth, for all five conditions, we used claims data for only one year prior to the REGARDS in-home visit to define pre-specified and data mining Medicare variables, instead of using all available claims.

BMC Health Services Research

Given the complexity of water-associated infectious disease, statistical data mining and variable selection techniques using tree-based searches through the model space (Breiman 2001) may be useful.

Environmental Health Perspectives

Table 5 Results of the robustness test for data mining bias Proxy variable Number of different variable operationalizations in the primary studiesa Hyp.

Business Research

As regards data mining, patient-related variables like diagnosis, sex and birth date can be combined with data information in order to compose specific queries.

BMC Medical Informatics and Decision Making

In several data mining pipelines, important variables were selected from an RFM, which were subsequently used in other analysis techniques [ 50, 71].

Briefings in Bioinformatics

Ludwig, your English writing platform

Write better and faster with AI suggestions while staying true to your unique style.

Used by millions of students, scientific researchers, professional translators and editors from all over the world!

Since I tried Ludwig back in 2017, I have been constantly using it in both editing and translation. Ever since, I suggest it to my translators at ProSciEditing.

Justyna Jupowicz-Kozak

CEO of Professional Science Editing for Scientists @ prosciediting.com

Get started for free

Unlock your writing potential with Ludwig

Most frequent sentences:

1-200 1k 2k 3k 4k 5k 7k 10k 20k 40k 100k 200k 500k 0m-3 0m-4 1m-1 1m-2 1m-3 1m-4