[3/3] The Power Search: Cheat Sheet

- Previous Article: [2/3] The Power Search: Syntactic Sugar

Overview of Important Properties

Property Description
label The title of the Signal.
summary The summary of the Signal.
country The country the Signals was published in.
publishedDate The date the Signal was published.
type The type of Signal.
source The source the Signals was ingested from.
newsSiteDomainRank Alexa rank assigned to the source of a Signal. Only applies to Signals of type News.
topic

Topics we have enriched the Signals with based on their textual data. These are the topics you can also see in the list of Signals as tags. Use together with the CONTAINS operator.

topic CONTAINS "Computer security"
iabCategory

IAB categories for industries we have enriched the Signals with based on their textual data. Use in combination with the CONTAINS operator. For a list of categories and how to use them, see How can I use the power search to filter by IAB Categories?

iptcCategory

IPTC media categories we have enriched the Signals with. This is the categorization that is currently used for the Cluster Visualization

iptcCategory CONTAINS "Business Information"
organization

Companies and other organizations that are enriched using named entity recognition (NER). Use in combination with the CONTAINS operator.

organization CONTAINS "Microsoft"
person

People that are enriched using NER. Use in combination with the CONTAINS operator.

person CONTAINS "Bill Gates"
place Places that enriched using NER. Use in combination with the CONTAINS operator.
place CONTAINS "California"

journal

Searches through the title of journals that scientific publications have been published in.

journal CONTAINS "International Research Journal of Computer Science"

Overview of Basic Operators

To help out with the example below, let us consider the titles of the following two fictitious Signals:

A: "Wireless Energy Transfer"
B: "Energy Transfer without Wires"

Note that string matching is not case-sensitive and that many non-alpha-numeric characters such as dashes (-) are considered white space.

Exact String Match

The equals-operator (=) will return a Signal if the value is exactly matched.

1. label = "Wireless Energy Transfer"
2. label = "Wireless"
  • Search 1 will produce Signal A because the title and the value in the search are exactly the same.
  • Search 2 on the other hand will not succeed in producing Signal A.

Contains

The contains-operator (CONTAINS) searches for the exact words within the property.

1. label CONTAINS "Wire"
2. label CONTAINS "Wireless"
  • Search 1 will produce no results as Signals A and B include words that only begin with Wire but continue on.
  • Search 2 will produce Signal A as its title contains the exact word Wireless.

Fuzzy String Match

The tilde-operator (~) will perform a fuzzy string match which will usually produce a larger set of results.

1. label ~ "Wire"
2. label = "Wireless Charging"
  • Search 1 will produce both Signals A and B as they include words that include the search value within.
  • Search 2 will produce Signal A even though the word Chargindoesn't in fact occur in its title. The match of Wireless is sufficient for the fuzzy string match.

Comparison

Different comparison operators (<, <=, >, >=) can be used to compare values. Examples:

newsSiteDomainRank < 1000
publishedDate >= 1577880000000

Notes on Values

Note that for many properties such as the published date, country or the category, it is much easier to use the context inputs as these properties have many values that need to be entered precisely.

Country: For countries, we use the 2-letter ISO country codes.

Published date: We are currently using milliseconds since 01.01.1970. As this is not very practical, we will work on improving this.