Used and loved by millions
Since I tried Ludwig back in 2017, I have been constantly using it in both editing and translation. Ever since, I suggest it to my translators at ProSciEditing.

Justyna Jupowicz-Kozak
CEO of Professional Science Editing for Scientists @ prosciediting.com
dataset comprising
Grammar usage guide and real-world examplesUSAGE SUMMARY
The phrase "dataset comprising" is correct and usable in written English.
It can be used when describing the contents or components of a dataset in a formal or academic context. Example: "The study utilized a dataset comprising various demographic factors and health outcomes."
✓ Grammatically correct
Science
Alternative expressions(3)
Table of contents
Usage summary
Human-verified examples
Expert writing tips
Linguistic context
Ludwig's wrap-up
Alternative expressions
FAQs
Human-verified examples from authoritative sources
Exact Expressions
59 human-written examples
A dataset comprising 799 ground plots was used.
It is validated on a large dataset comprising clinical acquired DW images from 741 subjects.
Science
Additional file 1: Expanded LiverTox dataset comprising 178 hepatotoxic, 185 nonhepatotoxic, and 242 possible hepatotoxic compounds.
Science
A dataset comprising 36 purine nucleoside analogs was selected for the present investigation (Fig. 1 and Table 1).
Science
This paper presents air permeability results from the largest UK dataset, comprising 144,024 dwellings tested under the Air Tightness Testing and Measurement Association ATTMAA) scheme.
Science
A dataset comprising 169 diverse retinal images was tested, and the segmentation results were assessed by a gold standard derived from the annotations of five domain experts.
The large substation load dataset, comprising load time series with annual interval, is firstly clustered by the proposed linear clustering method.
Quantitative PCM models were then trained on a dataset comprising 20 eukaryotic, protozoan and bacterial DHFR sequences, and 1,505 distinct compounds (in total 3,099 data points).
Science
INRIA Holidays [19] is a dataset comprising 1491 high-resolution personal holiday photos of different locations and objects, 500 of which are used as queries.
On an in-house dataset comprising 700 iris images of 70 subjects, a FNMR of 0.47 % at a zero FMR is reported.
Human-verified similar examples from authoritative sources
Similar Expressions
1 human-written examples
This re-organization resulted in a compendium (meta-dataset) comprising 989 unique adenocarcinoma samples from seven independent cohorts.
Science
Expert writing Tips
Best practice
When describing a dataset, clearly specify what the "dataset comprising" includes to provide context and ensure understandability.
Common error
Avoid using "dataset comprising" without specifying the key elements or variables included. Be specific to provide clarity and avoid ambiguity.
Source & Trust
82%
Authority and reliability
4.5/5
Expert rating
Real-world application tested
Linguistic Context
The phrase "dataset comprising" functions as a descriptive element, specifying the contents or components of a dataset. Ludwig's examples show its use in various scientific and academic contexts.
Frequent in
Science
100%
Less common in
News & Media
0%
Formal & Business
0%
Ludwig's WRAP-UP
The phrase "dataset comprising" is a grammatically correct and widely used expression, particularly within scientific and academic writing. Ludwig AI confirms its acceptability and provides numerous examples from reputable sources. It serves to clearly define the elements that constitute a specific dataset, aiding in clarity and precision. While alternatives such as "dataset consisting of" or "dataset including" exist, "dataset comprising" remains a strong and effective choice. Remember to provide specific details about the elements included in the "dataset comprising" to ensure clarity and avoid vagueness in your writing.
More alternative expressions(6)
Phrases that express similar concepts, ordered by semantic similarity:
dataset consisting of
This alternative emphasizes the components that make up the dataset.
dataset including
This alternative suggests that the listed items are part of the dataset, but the dataset might contain more.
dataset encompassing
This alternative implies that the dataset comprehensively covers the listed elements.
dataset featuring
This alternative highlights prominent or notable elements within the dataset.
dataset incorporating
This alternative indicates that the dataset combines or integrates the specified components.
dataset constituted by
This is a more formal and slightly less common way of saying "dataset comprising".
dataset made up of
This alternative is more informal but conveys the same meaning.
dataset with
This alternative is a more concise way to express the composition of the dataset.
dataset containing
This alternative specifies the dataset holds the particular components inside of it.
dataset composed of
This alternative describes the fundamental parts the dataset is created from.
FAQs
What does "dataset comprising" mean?
The phrase "dataset comprising" means that the dataset is made up of or includes the items, elements, or information that follow. It indicates the composition of the dataset.
How can I use "dataset comprising" in a sentence?
You can use it to describe what a dataset contains. For example, "The study used a "dataset comprising" demographic data and health records."
What are some alternatives to "dataset comprising"?
Alternatives include "dataset consisting of", "dataset including", or "dataset containing", depending on the context.
Is there a difference between "dataset comprising" and "dataset consisting of"?
While similar, "dataset comprising" may imply that the listed items are the most important or relevant, while "dataset consisting of" suggests a complete listing of all elements.
Editing plus AI, all in one place.
Stop switching between tools. Your AI writing partner for everything—polishing proposals, crafting emails, finding the right tone.
Table of contents
Usage summary
Human-verified examples
Expert writing tips
Linguistic context
Ludwig's wrap-up
Alternative expressions
FAQs
Source & Trust
82%
Authority and reliability
4.5/5
Expert rating
Real-world application tested