Patent attributes
Guided exploration of data is provided. A semantic graph corresponding to a dataset is generated using identified relations among columns of the dataset that are identified based on mapping the columns to main concepts in a generic ontology. A subgraph of the semantic graph is formed based on identification of nodes corresponding to relevant central concepts within the semantic graph using graph centrality metrics. A plurality of paths is identified in the subgraph using a bi-directional multiple hop search from the nodes corresponding to the relevant central concepts in the subgraph. A relevance score is assigned to each path in the plurality of paths using a graph-theoretic metric and the graph centrality metrics. A set of natural language questions based on relevant central concepts and concept relations corresponding to nodes in each respective path with an assigned relevance score greater than a minimum threshold score is output to the user.