Concepts
SenseML
SenseML is a query language that lets you extract structured data from documents, for example, from PDFs. A field is the basic SenseML query unit for extracting a piece of document data. The output of a field is a JSON key-value pair that structures the extracted data.
Here’s a simple example of a field:
JSON
The following image shows this example in the Sensible app:
As the preceding image shows, here’s the output of the example field:
JSON
This example shows the following key concepts:
key | description |
---|---|
field | A query that extracts data in relationship to matched text. Its ID is the key for the extracted data. In this example, name_of_output_key. |
anchor | Matched text that helps narrow down a location in the document from which to extract data. In this example, “an anchor is some text to match…“. |
method | Defines how to expand out from the anchor and extract data. In this example, the Label method extracts data that’s below the anchor (“position”: “below”). For a list of methods, see Methods. |
For a more complete SenseML example, see the SenseML introduction.