A Dataset is a namespace or bucket for collecting or grouping concepts together.
The cards listed on your Data Graphs app home page each represent one Dataset. On this card you can see how many Concepts are contained in this Dataset and the types of Concepts it contains.
Data Sets provide two main functions:
- Information management - a way to efficiently or conveniently manage data
- Security - a way to make some of your data public or privately accessible to applications at integration time
A Dataset does not add any semantic value to your knowledge graph, but does provide a nice way for you to group data together to improve its manageability.
For example, you may wish to put all concepts of one type into its own Dataset, such that a Dataset of Organizations would all be collected together. Alternatively you may have defined a Person type and create one Dataset to hold all politicians and another Dataset to hold all sports people.
* Remember - you can still federate across all your Datasets - putting the same type of concepts into multiple data sets does not prevent you from searching and navigating data by type across your entire knowledge graph.
A Dataset can be declared as public or private. This defines how the data within the Dataset is accessed by the Data Graphs API.
Private - A Dataset declared as private can only be accessed using OpenID credentials. This means your data in this Data Set is secure and can only be accessed via an authenticated user request. Machine-2-Machine OpenID / oAuth credentials can be setup in the account management configuration settings.
Public - A Dataset declared as public can be accessed using just your API key. These data requests can be made without authentication.