Blog

Sociallinks database

Case sudy with Social Links
image
SocialLinks database - is a graph database, in which we have already uploaded about 7 billion. (6.4 bln. at the end of 2017) records about people, companies, places and their connections.
Most of the data is obtained by parsing a variety of white and yellow pages, company registers, business directories, social networks and other open online sources.

image

We ensure that fields with the same entities are always called the same, i.e. you can be sure that in the phone field-always there will be only phone numbers, in the field alias - nicknames or usernames, in ip - ip-addresses, etc. This allows searching immediately in the whole database, as well as to supplement the output of the found results with related records.
We also do normalization and cleanup of basic fields .

image

In building relationships, we use both natural links by clear identifiers (mail, phone, social network ID, IP), and links from the original data source. For example, when one record contains information about a person and company or a list of employees in a company.

image

image

image

SL DB contains data from all over the world and serves as a great addition to our online sources.


image

Based on data from SL DB we made several separate transformations:
  • [SL DB] IP to Emails and [SL DB] Email to IPs
  • [SL DB] IP to Phones and [SL DB] Phone to IPs
  • [SL DB] To Emails @domain

image

              image