Congratulations!

[Valid Atom 1.0] This is a valid Atom 1.0 feed.

Recommendations

This feed is valid, but interoperability with the widest range of feed readers could be improved by implementing the following recommendations.

Source: https://urlscan.io/blog/feed.xml

  1. <?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="4.2.2">Jekyll</generator><link href="https://urlscan.io/blog/feed.xml" rel="self" type="application/atom+xml" /><link href="https://urlscan.io/blog/" rel="alternate" type="text/html" /><updated>2024-01-16T15:03:10+01:00</updated><id>https://urlscan.io/blog/feed.xml</id><title type="html">Blog - urlscan.io</title><subtitle>urlscan.io Blog - Announcements, Product News, Tutorials, Service Incidents</subtitle><author><name>urlscan.io</name></author><entry><title type="html">urlscan Pro — Inline Matching, System-Labels, User-Tags</title><link href="https://urlscan.io/blog/2024/01/10/inline-matching-system-labels-user-tags/" rel="alternate" type="text/html" title="urlscan Pro — Inline Matching, System-Labels, User-Tags" /><published>2024-01-10T00:00:00+01:00</published><updated>2024-01-10T00:00:00+01:00</updated><id>https://urlscan.io/blog/2024/01/10/inline-matching-system-labels-user-tags</id><content type="html" xml:base="https://urlscan.io/blog/2024/01/10/inline-matching-system-labels-user-tags/"><![CDATA[<p>As we welcome the year 2024, we wanted to update you on what we have been
  2. working on in the second half of 2023 and announce the new features that are
  3. launching today. These changes will have a profound impact for our customer
  4. workflows and our own detection and classification abilities.</p>
  5.  
  6. <h3 id="saved-searches--a-success-story">Saved Searches — A success story</h3>
  7.  
  8. <p>When we launched <em>Saved Searches</em> in 2022 for our <code class="language-plaintext highlighter-rouge">scans</code> and <code class="language-plaintext highlighter-rouge">hostnames</code>
  9. feeds, we did not envision how popular this feature would turn out to be.
  10. Initially, Saved Searches were meant as a convenient way to bookmark a search
  11. term within the urlscan Pro platform. The <em>Subscriptions</em> feature allowed
  12. customers to receive notifications for any new items that matched their Saved
  13. Searches.</p>
  14.  
  15. <p>Over the past year, the value of Saved Searches to customers has become
  16. abundantly clear. Right now we manage more than 3000 Saved Searches and almost
  17. 1000 Subscriptions that have been created by our customers. Our subscription
  18. notification system sends out over 5000 emails a day.</p>
  19.  
  20. <p>Saved Searches and Subscriptions became even more important when we launched
  21. our <a href="/blog/2022/11/22/newly-observed-domains-and-hostnames/">Newly Observed Domains &amp; Hostnames
  22. Feed</a> in late 2022 and
  23. <a href="/blog/2023/07/18/launching-urlscan-observe/">urlscan Observe</a> earlier this
  24. year. Since then, many of our customers have set up Saved Searches to look for
  25. domains impersonating their brand or targeting their workforce. Our feed
  26. captures 2.5 million new domains and hostnames every day, so having an
  27. expressive search ability to find and alert on interesting hits is crucial.</p>
  28.  
  29. <p>Today we are launching <em>major improvements</em> for Saved Searches, Subscriptions and
  30. collaboration within the urlscan Pro platform.</p>
  31.  
  32. <!--more-->
  33.  
  34. <h3 id="saved-searches-before">Saved Searches Before</h3>
  35.  
  36. <p>Until now, Saved Searches and Subscriptions were basically just stored entries
  37. in a database. In order to alert on a Saved Search, we would run its full query
  38. term against the Search API every time we wanted to compile a list of hits.
  39. While this approach would yield the correct results, it was not great for
  40. multiple reasons:</p>
  41.  
  42. <ul>
  43.  <li>As the number of Saved Searches grew, we had to run thousands of queries,
  44. sometimes as often as every few minutes.</li>
  45.  <li>Our own Visual Detects and Deletion rules used the same timed method for
  46. querying, resulting in a delay of 1-2 minutes before a Visual Detect was
  47. applied or a scan was deleted.</li>
  48.  <li>Customers could not easily determine if a specific result returned by Search
  49. API did match any of their Saved Searches.</li>
  50. </ul>
  51.  
  52. <p>We knew that we had to improve the implementation of Saved Searches
  53. and Subscriptions if we wanted to continue scaling our platform and user base
  54. and offer more advanced features.</p>
  55.  
  56. <p><img src="/blog/assets/images/saved-searches-old.png" alt="Saved Searches - Old Implementation" />
  57. <em>Our old saved search workflow</em></p>
  58.  
  59. <h3 id="saved-searches-now--inline-matching">Saved Searches Now — Inline matching</h3>
  60.  
  61. <p>As part of this release, we have changed Saved Searches so that they are now
  62. executed <strong>inline</strong> against new scans, hostnames and domains entering our
  63. feeds. Elasticsearch calls this process <em>“percolation”</em>. Whenever urlscan
  64. performs a new scan or finds a new domain, it is executed against
  65. various types of stored rules in two passes:</p>
  66. <ul>
  67.  <li>During the first pass, urlscan applies its
  68. own block-and-delete rules, its brand-and-phishing detections, and its new
  69. System Labels (see next paragraph).</li>
  70.  <li>During the second pass, we
  71. execute the thousands of Saved Searches by our customers. Matching searches are
  72. recorded in a special <code class="language-plaintext highlighter-rouge">meta</code> field in our Elasticsearch index. Customers can
  73. then query this <code class="language-plaintext highlighter-rouge">meta</code> field with the ID of the search (or subscription) they
  74. want to retrieve new results for.</li>
  75. </ul>
  76.  
  77. <p><img src="/blog/assets/images/saved-searches-new.png" alt="Saved Searches - Inline Matching" />
  78. <em>New inline matching pipeline</em></p>
  79.  
  80. <p>As part of a Saved Search, customers can also apply their own <em>custom tags</em> to
  81. matching items. These can be arbitrary tags, and customers can control the
  82. visibility of these tags within the urlscan Pro platform. User-supplied tags
  83. will appear in the <code class="language-plaintext highlighter-rouge">usertags</code> fields in the Search API and Result API.</p>
  84.  
  85. <h4 id="new-customer-capabilities">New customer capabilities</h4>
  86.  
  87. <p>Executing and matching rules inline has a couple of interesting implications:</p>
  88.  
  89. <ul>
  90.  <li>Customers can query for all items that match multiple specific searches,
  91. or that match one search but don’t match another one.</li>
  92.  <li>Customers can query for all items that match any search within a specific
  93. subscription with a single query term, or any of their searches period.</li>
  94.  <li>When looking at search results, customers can determine whether any
  95. particular result matched any of their Saved Searches.</li>
  96.  <li>Customers can use labels from the first matching pass in their
  97. Saved Searches, e.g. to filter by system labels or brand detections.</li>
  98.  <li>Customers can share results of their Saved Searches with other users on the
  99. urlscan Pro platform without exposing their actual search terms.</li>
  100.  <li>Complex queries could easily take multiple seconds to execute. With inline
  101. matching, the expensive matching step is done during ingestion and the
  102. customer can run a very efficient keyword query to retrieve all results.</li>
  103.  <li>Visual Searches can also be run inline, making them much more efficient and
  104. instant. Inline Visual Searches will be more accurate than using the Visual
  105. Search API which relies on <em>approximate</em> nearest-neighbour searches.</li>
  106. </ul>
  107.  
  108. <h4 id="system-labels">System Labels</h4>
  109.  
  110. <p><strong>System Labels</strong> are classifications applied to scans and hostnames by dynamic
  111. rules managed by urlscan. Labels will be returned in the <code class="language-plaintext highlighter-rouge">labels</code> field which
  112. is a different namespace than user-defined tags. The idea for labels is that
  113. these are stable and exhaustively documented, whereas user tags can be
  114. arbitrary, short-lived and in some cases imprecise.</p>
  115.  
  116. <p>Going forward, we will curate labels covering common classification objectives that
  117. our customers can use to include or exclude from their search results. Here
  118. are some examples of labels for scans we plan on introducing:</p>
  119.  
  120. <ul>
  121.  <li><code class="language-plaintext highlighter-rouge">content.mature</code> - Page likely contains mature content.</li>
  122.  <li><code class="language-plaintext highlighter-rouge">content.opendir</code> - Page is showing an open directory.</li>
  123.  <li><code class="language-plaintext highlighter-rouge">site.parkeddomain</code> - Domain is currently parked / for sale.</li>
  124.  <li><code class="language-plaintext highlighter-rouge">site.takedown</code> - Site is showing a takedown / disabled-account notice.</li>
  125.  <li><code class="language-plaintext highlighter-rouge">tech.captcha</code> - Site employed a captcha prompt.</li>
  126.  <li><code class="language-plaintext highlighter-rouge">tech.captcha.waiting</code> - Site is showing an unsolved Captcha prompt.</li>
  127. </ul>
  128.  
  129. <p>These are just some initial ideas and we know our customers will come up with
  130. all kinds of creative ways they want to consume our data.</p>
  131.  
  132. <h4 id="additional-benefits">Additional Benefits</h4>
  133.  
  134. <p>With Inline Matching in place, we were able to improve various aspects of our
  135. platform. For customers, the most visible changes include:</p>
  136.  
  137. <ul>
  138.  <li><em>Saved Searches</em> are sanity-checked before we allow them to be saved. Trying
  139. to search fields which don’t exist will now throw an error.</li>
  140.  <li><em>Account-protection for scans</em> — Upon request, we can restrict access
  141. to scan results to the owner of that account. (Feature available in
  142. Enterprise and Ultimate).</li>
  143.  <li><em>Improved Brand Detection</em> — We will use the expressiveness of our
  144. inline matching to craft more encompassing detection rules for our brand- and
  145. phishing-feed.</li>
  146.  <li><em>Instant Visual Detections</em> — Visual Detects for phishing pages are now
  147. performed instantly. Previously these would have a small 1-2 minute delay.</li>
  148.  <li><em>Instant Deletion</em> — Once we add a block-and-delete rule, it is
  149. effective immediately for scan results.</li>
  150.  <li><em>Unified Blocking Logic</em> — URLs that are blocked from scanning are
  151. maintained by us in a unified way along with delete rules. These blocking
  152. rules also contain more context about why a certain URL or hostname might be
  153. blocked.</li>
  154. </ul>
  155.  
  156. <h4 id="next-steps">Next steps</h4>
  157.  
  158. <p>We are launching inline matching today, but for us this is just the first step
  159. of many. Over the coming weeks and months we will examine how our customers are
  160. adopting the new features and what they might be missing. We will also expose
  161. these new capabilities gradually via our urlscan Pro UI. Make sure to keep an
  162. eye on our changelog!</p>
  163.  
  164. <h3 id="api-changes">API Changes</h3>
  165.  
  166. <p>On top of the new features launching with this release, we also made small
  167. improvements to the platform in various places. This is a summary of changes
  168. to API behaviour within this release:</p>
  169.  
  170. <ul>
  171.  <li><strong>Result API</strong>: If a scan has finished but has since been deleted it will now
  172. return a <strong>HTTP/410</strong> error code. If you receive this code from the Result
  173. API, you can stop trying to retrieve the result.</li>
  174.  <li><strong>Result API</strong>: Introduction of the following new fields to achieve more uniformity with the Search API:
  175.    <ul>
  176.      <li><code class="language-plaintext highlighter-rouge">labels</code>: System Labels (see above) - Only in urlscan Pro</li>
  177.      <li><code class="language-plaintext highlighter-rouge">usertags</code>: User Tags (see above) - Only in urlscan Pro</li>
  178.      <li><code class="language-plaintext highlighter-rouge">metatags</code>: Meta hits for this item - Only in urlscan Pro (<strong>Attention</strong>: This field is called <code class="language-plaintext highlighter-rouge">meta</code> in the Search API)</li>
  179.      <li><code class="language-plaintext highlighter-rouge">page.apexDomain</code>: The registered second-level domain of the page hostname</li>
  180.      <li><code class="language-plaintext highlighter-rouge">page.mimeType</code>: Page MIME type</li>
  181.      <li><code class="language-plaintext highlighter-rouge">page.redirected</code>: Whether the page was redirected</li>
  182.      <li><code class="language-plaintext highlighter-rouge">page.status</code>: HTTP response code for primary page</li>
  183.      <li><code class="language-plaintext highlighter-rouge">page.title</code>: Title of the website</li>
  184.      <li><code class="language-plaintext highlighter-rouge">page.tlsAgeDays</code>: Age of the TLS certificate at the time of scanning</li>
  185.      <li><code class="language-plaintext highlighter-rouge">page.tlsIssuer</code>: TLS issuer name for the TLS cert of the page</li>
  186.      <li><code class="language-plaintext highlighter-rouge">page.tlsValidDays</code>: Validity period of the TLS certificate in days</li>
  187.      <li><code class="language-plaintext highlighter-rouge">page.tlsValidFrom</code>: ISO 8601 timestamp of valid-from date for page TLS certificate</li>
  188.      <li><code class="language-plaintext highlighter-rouge">page.umbrellaRank</code>: Cisco Umbrella rank of the page hostname</li>
  189.      <li><code class="language-plaintext highlighter-rouge">task.apexDomain</code>: The registered second-level domain of the task hostname</li>
  190.    </ul>
  191.  </li>
  192.  <li><strong>Search API</strong>: Introduction of the following new fields:
  193.    <ul>
  194.      <li><code class="language-plaintext highlighter-rouge">labels</code>: System Labels (see above) - Only in urlscan Pro</li>
  195.      <li><code class="language-plaintext highlighter-rouge">usertags</code>: User Tags (see above) - Only in urlscan Pro</li>
  196.      <li><code class="language-plaintext highlighter-rouge">meta</code>: Meta hits for this item - Only in urlscan Pro (<strong>Attention</strong>: This field is called <code class="language-plaintext highlighter-rouge">metatags</code> in the Result API)</li>
  197.    </ul>
  198.  </li>
  199.  <li><strong>Search API</strong>: It will now respond with <strong>HTTP/503</strong> instead of <strong>HTTP/400</strong>
  200. if our search cluster is over capacity. You should wait a few seconds before
  201. attempting to run your search again.</li>
  202. </ul>
  203.  
  204. <h3 id="availability">Availability</h3>
  205.  
  206. <p>Inline Matching, User-Defined Tagging and System Labels are available starting
  207. today and is included for all customers on our <em>Professional</em>, <em>Enterprise</em> and
  208. <em>Ultimate</em> plans.</p>
  209.  
  210. <p>If you want to learn about urlscan Pro platform and how it might be valuable
  211. for your organisation feel free to reach out to us! We offer free trials with
  212. no strings attached. We would be happy to give you a passionate demo of what
  213. our platform can do for you. Reach out to us at
  214. <a href="mailto:sales@urlscan.io">sales@urlscan.io</a>.</p>]]></content><author><name>Johannes Gilger</name></author><category term="changelog" /><category term="product" /><summary type="html"><![CDATA[As we welcome the year 2024, we wanted to update you on what we have been working on in the second half of 2023 and announce the new features that are launching today. These changes will have a profound impact for our customer workflows and our own detection and classification abilities. Saved Searches — A success story When we launched Saved Searches in 2022 for our scans and hostnames feeds, we did not envision how popular this feature would turn out to be. Initially, Saved Searches were meant as a convenient way to bookmark a search term within the urlscan Pro platform. The Subscriptions feature allowed customers to receive notifications for any new items that matched their Saved Searches. Over the past year, the value of Saved Searches to customers has become abundantly clear. Right now we manage more than 3000 Saved Searches and almost 1000 Subscriptions that have been created by our customers. Our subscription notification system sends out over 5000 emails a day. Saved Searches and Subscriptions became even more important when we launched our Newly Observed Domains &amp; Hostnames Feed in late 2022 and urlscan Observe earlier this year. Since then, many of our customers have set up Saved Searches to look for domains impersonating their brand or targeting their workforce. Our feed captures 2.5 million new domains and hostnames every day, so having an expressive search ability to find and alert on interesting hits is crucial. Today we are launching major improvements for Saved Searches, Subscriptions and collaboration within the urlscan Pro platform.]]></summary></entry><entry><title type="html">Announcing urlscan Observe</title><link href="https://urlscan.io/blog/2023/07/18/launching-urlscan-observe/" rel="alternate" type="text/html" title="Announcing urlscan Observe" /><published>2023-07-18T00:00:00+02:00</published><updated>2023-07-18T00:00:00+02:00</updated><id>https://urlscan.io/blog/2023/07/18/launching-urlscan-observe</id><content type="html" xml:base="https://urlscan.io/blog/2023/07/18/launching-urlscan-observe/"><![CDATA[<p>urlscan.io has always been a powerful tool for scanning and investigating
  215. suspicious websites. Our platform is used by hundreds of customers and tens of
  216. thousands of community users to scan suspicious URLs. Up until now, the
  217. majority of these scans were initiated by customers.</p>
  218.  
  219. <p>Today we are announcing the general availability of <strong>urlscan Observe</strong>, our
  220. new and integrated hands-off monitoring system on the urlscan Pro platform.
  221. urlscan Observe ties together our extensive data collection with our
  222. notification and scanning features to drive fast and automated monitoring of
  223. suspected malicious infrastructure.</p>
  224.  
  225. <p><a href="/blog/assets/images/urlscan-observe1.png">
  226. <img src="/blog/assets/images/urlscan-observe1.png" title="urlscan Observe" />
  227. </a></p>
  228.  
  229. <!--more-->
  230.  
  231. <h3 id="urlscan-observe-the-idea">urlscan Observe: The idea</h3>
  232.  
  233. <p>urlscan Observe aims to fill two gaps in existing automation workflows:</p>
  234.  
  235. <ul>
  236.  <li>Automatically <em>discovering</em> interesting things such as domains, hostnames, IPs, or URLs.</li>
  237.  <li>Automatically <em>monitoring</em> these things for activity and changes.</li>
  238. </ul>
  239.  
  240. <p>Using the example of domains used for phishing and brand impersonation gives a
  241. good overview of the challenges involved. Proactively looking out for
  242. suspicious domains is something that a lot of our customers are already doing,
  243. and there are a variety of commercial and Open Source tools available for
  244. surfacing these. The easiest way to get started would be to start monitoring
  245. free data sources like <a href="https://certstream.calidog.io/">certstream</a> for TLS
  246. hostnames of interest. While finding suspicious domains might be relatively
  247. easy, the hard part is what happens next: Monitoring these domains to see if
  248. they ever go <em>live</em>.</p>
  249.  
  250. <p>We have built urlscan Observe to actively <em>monitor</em> observables, scan them
  251. using our web scanning engine, resolve DNS records, capture any observations,
  252. alert you about major changes and show the whole timeline in a dedicated UI.</p>
  253.  
  254. <h3 id="monitoring-example-suspicious-domains">Monitoring Example: Suspicious domains</h3>
  255.  
  256. <p>To understand the different stages in the lifecycle of a piece of
  257. infrastructure, let’s have a look at a fictional albeit common timeline of a
  258. suspicious domain and what we can observe about it:</p>
  259.  
  260. <ul>
  261.  <li><strong>1:12am</strong> We observe the domain for the first time in a DNS zonefile. It does not have any DNS A/AAAA records yet.</li>
  262.  <li><strong>2:40pm</strong> The domain starts to resolve to an IPv4 address.</li>
  263.  <li><strong>2:45pm</strong> We observe the first TLS certificate for this domain in a Certificate Transparency (CT) log.</li>
  264.  <li><strong>2:55pm</strong> The domain starts responding to HTTP requests. It only carries an empty landing page.</li>
  265.  <li><strong>3:12pm</strong> The domain starts serving a directory index listing on HTTP.</li>
  266.  <li><strong>3:36pm</strong> The domain starts serving a phishing site via HTTP.</li>
  267.  <li><strong>6:15pm</strong> The domain has deactivated by the hosting company and is now serving an empty placeholder page.</li>
  268.  <li><strong>8:50pm</strong> The domain stops resolving via DNS.</li>
  269. </ul>
  270.  
  271. <p>This timeline of events tells the story of when and how the malicious domain
  272. was first set up, how quickly it went live and how long it took until it was
  273. taken down again. At every step of this lifecycle one has to monitor the domain
  274. (DNS, HTTP, TLS certs) and compare observations with previous time intervals to
  275. figure out if anything about the domain has recently changed. This is exactly
  276. what urlscan Observe will automatically do for you going forward.</p>
  277.  
  278. <h3 id="urlscan-observe-workflow">urlscan Observe workflow</h3>
  279.  
  280. <p>urlscan Observe monitors <em>Observables</em> such as hostnames, domains, IPs and URLs
  281. as part of <em>Incidents</em> within urlscan Pro. There are two ways to create these
  282. incidents:</p>
  283.  
  284. <ul>
  285.  <li>You can manually create an incident by supplying your own hostname, domain, IP, or URL.</li>
  286.  <li>You can set up <em>Saved Searches</em> and a <em>Subscription</em> within urlscan Pro to automatically create incidents for new observables.</li>
  287. </ul>
  288.  
  289. <div class="row bottom10">
  290. <div class="col col-md-6">
  291. <a href="/blog/assets/images/urlscan-observe3.png">
  292. <img src="/blog/assets/images/urlscan-observe3.png" title="urlscan Observe" />
  293. </a>
  294. </div>
  295. <div class="col col-md-6">
  296. <a href="/blog/assets/images/urlscan-observe2.png">
  297. <img src="/blog/assets/images/urlscan-observe2.png" title="urlscan Observe" />
  298. </a>
  299. </div>
  300. </div>
  301.  
  302. <p>Using Saved Searches is a really powerful tool because you can write a query
  303. that matches interesting observables in our <a href="/blog/2022/11/22/newly-observed-domains-and-hostnames/"><strong>Real-Time Newly Observed
  304. Hostnames &amp; Domains
  305. Feed</strong></a>. This feed
  306. captures hundreds of thousands of new domains and millions of new unique
  307. hostnames every day. Using our Search API you can write a query that matches
  308. hostnames of interest, either by strings within the hostname or by certain
  309. infrastructure attributes such as NS, MX or other DNS records.</p>
  310.  
  311. <p><a href="/blog/assets/images/urlscan-observe5.png">
  312. <img src="/blog/assets/images/urlscan-observe5.png" title="urlscan Observe" />
  313. </a></p>
  314.  
  315. <p>You can work with incidents from within the urlscan Pro UI, but you can also
  316. set up <em>Alerting Channels</em> to be notified whenever there are new incidents and
  317. changes to existing incidents. As part of urlscan Observe we have overhauled
  318. our notification system and can now send out notifications via E-Mail and
  319. Webhooks.</p>
  320.  
  321. <h3 id="availability">Availability</h3>
  322.  
  323. <p>urlscan Observe is available starting today and is included for all customers
  324. on our <em>Professional</em>, <em>Enterprise</em> and <em>Ultimate</em> plans.</p>
  325.  
  326. <p>If you want to learn about urlscan Pro platform and how it might be valuable
  327. for your organisation feel free to reach out to us!  We offer free trials with
  328. no strings attached. We would be happy to give you a passionate demo of what
  329. our platform can do for you. Reach out to us at
  330. <a href="mailto:sales@urlscan.io">sales@urlscan.io</a>.</p>]]></content><author><name>Johannes Gilger</name></author><category term="changelog" /><category term="product" /><summary type="html"><![CDATA[urlscan.io has always been a powerful tool for scanning and investigating suspicious websites. Our platform is used by hundreds of customers and tens of thousands of community users to scan suspicious URLs. Up until now, the majority of these scans were initiated by customers. Today we are announcing the general availability of urlscan Observe, our new and integrated hands-off monitoring system on the urlscan Pro platform. urlscan Observe ties together our extensive data collection with our notification and scanning features to drive fast and automated monitoring of suspected malicious infrastructure.]]></summary></entry><entry><title type="html">2022 year in review and new products launching in 2023</title><link href="https://urlscan.io/blog/2023/01/12/2022-year-in-review-and-new-products-launching-in-2023/" rel="alternate" type="text/html" title="2022 year in review and new products launching in 2023" /><published>2023-01-12T00:00:00+01:00</published><updated>2023-01-12T00:00:00+01:00</updated><id>https://urlscan.io/blog/2023/01/12/2022-year-in-review-and-new-products-launching-in-2023</id><content type="html" xml:base="https://urlscan.io/blog/2023/01/12/2022-year-in-review-and-new-products-launching-in-2023/"><![CDATA[<p>If you’re not sick of hearing it yet: Here’s to a happy new year from all of us
  331. at urlscan.io!</p>
  332.  
  333. <p>We wanted to take the opportunity to revisit major changes that launched in
  334. 2022 and to give you a glimpse of our 2023 roadmap at the same time. Some of
  335. the things we have worked on in 2022 represent the foundation for new products
  336. due to launch over the next quarters.</p>
  337.  
  338. <!--more-->
  339.  
  340. <h3 id="may-2022-visual-search">May 2022: Visual Search</h3>
  341.  
  342. <p>In May of 2022 we announced the cutting-edge <a href="/blog/2022/05/02/visual-search/"><strong>Visual
  343. Search</strong></a> feature within urlscan Pro. By
  344. itself, this feature allows customers to hunt for scans of websites using only
  345. the visual appearance of their screenshot. urlscan itself uses the visual
  346. similarity of a website to improve our brand and phishing detection. The fact
  347. that we are able to offer this feature at our scale and with a meaningful
  348. accuracy means that we and our customers are now able to detect and attribute
  349. phishing pages using <em>Visual Similarity</em> instead of only relying on threshold
  350. or rule-based approaches which work on text or HTTP content of websites.</p>
  351.  
  352. <h3 id="july-2022-saved-searches--subscriptions">July 2022: Saved Searches &amp; Subscriptions</h3>
  353.  
  354. <p>In July we launched a frequently-requested feature in the form
  355. of <a href="/blog/2022/07/11/urlscan-pro-product-updates-for-q2-2022/"><strong>Saved Searches &amp; Email
  356. Subscriptions</strong></a>.
  357. This feature lets customers <em>bookmark</em> interesting hunt searches from within the
  358. urlscan Pro UI. Furthermore, customers are now able to <em>subscribe</em> to email
  359. alerts whenever there are new hits for their searches.</p>
  360.  
  361. <h3 id="november-2022-newly-observed-hostnames--domains">November 2022: Newly Observed Hostnames &amp; Domains</h3>
  362.  
  363. <p>In November of last year we launched the real-time <a href="/blog/2022/11/22/newly-observed-domains-and-hostnames/"><strong>Newly Observed Hostnames
  364. &amp; Domains</strong></a> datasource
  365. in urlscan Pro. As we said back then, this new feed of data means a great deal
  366. for both urlscan and our customers as it will enable multiple use-cases going
  367. forward. Previously we had to rely on someone submitting a URL to urlscan.io in
  368. order for us to know about the URL and its hostname. Now, we ingest millions of
  369. hostnames and domains every day from a variety of sources, and we can observe
  370. hostnames long before they are starting to host website content.</p>
  371.  
  372. <p>Access to the data is available through urlscan Pro, and our customers are
  373. already using it to identify potentially malicious domains and hostnames (e.g.
  374. typosquatting) but also to look at their own infrastructure, for example via
  375. subdomain enumeration.</p>
  376.  
  377. <h2 id="launching-in-2023-urlscan-observe">Launching in 2023: urlscan <em>Observe</em></h2>
  378.  
  379. <p>For the next year, the big theme for us will be combining the building blocks
  380. we already have into a more powerful and more convenient hands-off pipeline. A
  381. big recurring ask from our customers is a set-and-forget system that lets them
  382. enter keywords and search expressions and hand off the responsibility for
  383. monitoring any hits to urlscan.</p>
  384.  
  385. <p>That’s why, starting in Q2 of 2023, we will be offering a new module called
  386. <em>urlscan Observe</em>. urlscan Observe will be our hands-off monitoring system for new
  387. domains, hostnames, and URLs. It will perform the onerous job of watching for
  388. <em>changes</em> to interesting <em>things</em> (like domains and URLs) over <em>time</em>. The end
  389. result for customers is a concise timeline of changes to the <em>thing</em> being
  390. monitored along with frequent alerts.</p>
  391.  
  392. <p>In a nut-shell, urlscan <em>Observe</em> will support:</p>
  393. <ul>
  394.  <li>Customer-supplied keywords and search patterns to match against the newly observed domains &amp; hostnames feed</li>
  395.  <li>Sending customers alerts about new hits for these keywords</li>
  396.  <li>Automatically monitoring for changes related to these hits</li>
  397.  <li>Alerting customers about infrastructure changes to these hits: DNS records, new subdomains, active HTTP servers, website content, etc.</li>
  398.  <li>Letting customers manually specify domains and URLs to watch</li>
  399. </ul>
  400.  
  401. <h3 id="new-subscription-options">New Subscription Options</h3>
  402.  
  403. <p>For 2023 we have also overhauled <a href="https://urlscan.io/pricing/"><strong>our plans &amp;
  404. pricing</strong></a>. Going forward we have one subscription
  405. aimed at customers that are only looking for an automation capability, and we
  406. have three subscription levels for customers that also want to leverage our
  407. powerful urlscan Pro - Threat Hunting platform. We continue to work with
  408. <a href="https://urlscan.io/partners/">various resellers across the globe</a> to
  409. facilitate fast and painless procurement processes.</p>
  410.  
  411. <h3 id="talk-to-us">Talk to us!</h3>
  412.  
  413. <p>If you’re interested to learn about our current and upcoming capabilities and
  414. how these might help you automate and reduce your workload, talk to us. For all
  415. of our plans and features we offer a free 30-day trial with no strings
  416. attached. We’d be happy to give you a passionate demo of what our platform can
  417. do for you. Reach out to us at <a href="mailto:sales@urlscan.io">sales@urlscan.io</a>.</p>]]></content><author><name>Johannes Gilger</name></author><category term="changelog" /><summary type="html"><![CDATA[If you’re not sick of hearing it yet: Here’s to a happy new year from all of us at urlscan.io! We wanted to take the opportunity to revisit major changes that launched in 2022 and to give you a glimpse of our 2023 roadmap at the same time. Some of the things we have worked on in 2022 represent the foundation for new products due to launch over the next quarters.]]></summary></entry><entry><title type="html">urlscan Pro - Newly observed domains &amp;amp; hostnames</title><link href="https://urlscan.io/blog/2022/11/22/newly-observed-domains-and-hostnames/" rel="alternate" type="text/html" title="urlscan Pro - Newly observed domains &amp;amp; hostnames" /><published>2022-11-22T00:00:00+01:00</published><updated>2022-11-22T00:00:00+01:00</updated><id>https://urlscan.io/blog/2022/11/22/newly-observed-domains-and-hostnames</id><content type="html" xml:base="https://urlscan.io/blog/2022/11/22/newly-observed-domains-and-hostnames/"><![CDATA[<p>Today we are officially launching our <strong>real-time feed and search index of newly
  418. observed hostnames and domains</strong> on urlscan Pro. This is a huge step forward
  419. since it will allow customers to proactively look for new domains and hostnames
  420. that might be of interest to them, even if these hostnames were not previously
  421. scanned as a full-blown website through urlscan.io.</p>
  422.  
  423. <div class="row">
  424. <div class="col col-md-12">
  425. <img src="/blog/assets/images/hostnames-search.png" title="urlscan Pro - Newly observed domains &amp; hostnames" />
  426. </div>
  427. </div>
  428.  
  429. <!--more-->
  430.  
  431. <p>The <em>hostnames</em> data source tracks newly
  432. observed domains and hostnames sourced from multiple data sources including
  433. Certificate Transparency, passive DNS monitoring, existing scan data,
  434. zonefiles, etc. This data source lets customers:</p>
  435.  
  436. <ul>
  437.  <li>Search for new domains and hostnames even if they were not scanned on urlscan.io.</li>
  438.  <li>Search for new domains and hostnames using keywords.</li>
  439.  <li>Search for new domains based on infrastructure (such as A, MX, or NS records).</li>
  440.  <li>Enumerate subdomains and hostnames for specific domains.</li>
  441.  <li>Enumerate hostnames / domains on a specific IP address.</li>
  442.  <li>Perform reconnaissance and lightweight attack surface discovery.</li>
  443.  <li>Export results via an API call or as a CSV file via the UI.</li>
  444.  <li>Create saved searches and email alerts. (Q1 / 2023)</li>
  445. </ul>
  446.  
  447. <p>Additionally, the <em>certificates</em> data source contains TLS certificates that have
  448. been collected from various Certificate Transparency (CT) logs. With these
  449. additions we are bringing in more of the discovery, alerting and analysis
  450. workflow into urlscan Pro that customers previously had to combine using
  451. auxiliary tools and platforms.</p>
  452.  
  453. <h3 id="hostnames--domains-by-the-numbers">Hostnames &amp; Domains by the numbers</h3>
  454.  
  455. <p>Currently our <em>hostnames</em> data sources contains <strong>1.5 billion unique
  456. hostnames</strong> across <strong>310 million unique registered domains</strong>. More than half of
  457. these domains already have a DNS resolution attached to them.</p>
  458.  
  459. <p>The <em>certificates</em> datasource contains <strong>more than 5 billion TLS certificates</strong>
  460. issued through Certificate Transparency logs.</p>
  461.  
  462. <p>Both of these data source operate in <em>real time</em>, with new records taking no
  463. more than a minute before they appear in the search results.</p>
  464.  
  465. <h3 id="next-steps">Next Steps</h3>
  466.  
  467. <p>Making these data sources available for searching is just the first step. Over
  468. the next year, these new additions will come together to form our fully
  469. automated hands-off monitoring pipeline. This new pipeline will allow our
  470. customers to set up keywords of interest within urlscan Pro and we will take care of the rest:</p>
  471.  
  472. <ul>
  473.  <li>We will watch for and alert on newly observed hostnames matching these keywords.</li>
  474.  <li>We will try to resolve these hostnames via DNS until they have a DNS record.</li>
  475.  <li>We will scan for potential websites on these hosts and alert customers when a site goes live.</li>
  476.  <li>We will monitor the page content for changes, such as when a page goes from landing page to fully set up website.</li>
  477. </ul>
  478.  
  479. <p>Furthermore these data sources will be more tightly integrated into the urlscan
  480. Pro platform. We will add the same set of convenience features customers have
  481. been used to, including <em>Saved Searches</em> and <em>E-Mail Subscriptions</em> for new
  482. hits.</p>
  483.  
  484. <h3 id="pricing--availability">Pricing &amp; Availability</h3>
  485.  
  486. <p>Access to these data sources and the upcoming alerting pipeline is free of
  487. charge for customers on active subscriptions. For new customers and
  488. subscription renewals customers can choose to add this functionality as a
  489. separate subscription module.</p>]]></content><author><name>Johannes Gilger</name></author><category term="product" /><category term="announcement" /><summary type="html"><![CDATA[Today we are officially launching our real-time feed and search index of newly observed hostnames and domains on urlscan Pro. This is a huge step forward since it will allow customers to proactively look for new domains and hostnames that might be of interest to them, even if these hostnames were not previously scanned as a full-blown website through urlscan.io.]]></summary></entry><entry><title type="html">Internet-Wide IPv4 Scan Data</title><link href="https://urlscan.io/blog/2022/09/28/internet-wide-scans/" rel="alternate" type="text/html" title="Internet-Wide IPv4 Scan Data" /><published>2022-09-28T00:00:00+02:00</published><updated>2022-09-28T00:00:00+02:00</updated><id>https://urlscan.io/blog/2022/09/28/internet-wide-scans</id><content type="html" xml:base="https://urlscan.io/blog/2022/09/28/internet-wide-scans/"><![CDATA[<p>We are now offering raw download access to the following datasets to interested customers:</p>
  490. <ul>
  491.  <li>Weekly Internet-wide scans of the whole IPv4 space on ports tcp/80 and tcp/443.</li>
  492.  <li>JSON output containing TLS certificates and HTTP responses.</li>
  493.  <li>More than 200GB of compressed raw data available per week.</li>
  494.  <li>More than 40 million HTTP responses on tcp/80 and more than 35 million on tcp/443.</li>
  495. </ul>
  496.  
  497. <!--more-->
  498.  
  499. <p>The HTTP scan results contain the following pieces of data and metadata:</p>
  500. <ul>
  501.  <li>Observation timestamp</li>
  502.  <li>GeoIP and ASN network annotation for the IP which was scanned</li>
  503.  <li>HTTP Status code &amp; string</li>
  504.  <li>HTTP Response Headers &amp; content</li>
  505.  <li>Response Body Size &amp; SHA256</li>
  506.  <li>Response Body (up to 64kB)</li>
  507.  <li>TLS Certificate Subject, Issuer, Serial Number and MD5/SHA1/SHA256 fingerprints</li>
  508. </ul>
  509.  
  510. <p>The certificate scan results contain certificates that responded to HTTPS requests:</p>
  511. <ul>
  512.  <li>Raw unparsed TLS certificate data</li>
  513.  <li>Parsed certificate data</li>
  514. </ul>
  515.  
  516. <h3 id="sample-data">Sample Data</h3>
  517. <p>These samples are 10,000 lines from the respective datasets and should give you an idea what’s contained within the data.</p>
  518.  
  519. <h4 id="http-scans-on-tcp80---http-responses">HTTP Scans on tcp/80 - HTTP Responses</h4>
  520. <ul>
  521.  <li><a href="https://urlscan.io/share/internetscan-samples/small-sample-slash80.json" target="_blank">Preview 10 result samples</a></li>
  522.  <li><a href="https://urlscan.io/share/internetscan-samples/sample-slash80.json.gz">Download 10k result samples (Gzipped)</a></li>
  523. </ul>
  524.  
  525. <h4 id="http-scans-on-tcp443---http-responses">HTTP Scans on tcp/443 - HTTP Responses</h4>
  526. <ul>
  527.  <li><a href="https://urlscan.io/share/internetscan-samples/small-sample-slash443.json" target="_blank">Preview 10 result samples</a></li>
  528.  <li><a href="https://urlscan.io/share/internetscan-samples/sample-slash443.json.gz">Download 10k result samples (Gzipped)</a></li>
  529. </ul>
  530.  
  531. <h4 id="http-scans-on-tcp443---tls-certificates">HTTP Scans on tcp/443 - TLS Certificates</h4>
  532. <ul>
  533.  <li><a href="https://urlscan.io/share/internetscan-samples/small-sample-slash443-certs.json" target="_blank">Preview 10 result samples</a></li>
  534.  <li><a href="https://urlscan.io/share/internetscan-samples/sample-slash443-certs.json.gz">Download 10k result samples (Gzipped)</a></li>
  535. </ul>
  536.  
  537. <h3 id="access-to-data">Access to Data</h3>
  538. <p>If you are interested in access to these data-sets then please reach out to us at support@urlscan.io. We offer free 30-day trial access.</p>]]></content><author><name>Johannes Gilger</name></author><category term="product" /><category term="announcement" /><summary type="html"><![CDATA[We are now offering raw download access to the following datasets to interested customers: Weekly Internet-wide scans of the whole IPv4 space on ports tcp/80 and tcp/443. JSON output containing TLS certificates and HTTP responses. More than 200GB of compressed raw data available per week. More than 40 million HTTP responses on tcp/80 and more than 35 million on tcp/443.]]></summary></entry><entry><title type="html">Scan Visibility Best Practices</title><link href="https://urlscan.io/blog/2022/07/27/scan-visibility-best-practices/" rel="alternate" type="text/html" title="Scan Visibility Best Practices" /><published>2022-07-27T00:00:00+02:00</published><updated>2022-07-27T00:00:00+02:00</updated><id>https://urlscan.io/blog/2022/07/27/scan-visibility-best-practices</id><content type="html" xml:base="https://urlscan.io/blog/2022/07/27/scan-visibility-best-practices/"><![CDATA[<p>This post talks about the different scan visibilities available on urlscan.io,
  539. which visibility you should use for different purposes and how to review your
  540. submission results on urlscan.io to detect and prevent inadvertent information
  541. leaks.</p>
  542.  
  543. <p><strong>tl;dr</strong>: Understand the different scan visibilities, review your own scans for non-public
  544. information, review your automated submission workflows, enforce a maximum scan
  545. visibility for your account and work with us to clean non-public data from urlscan.io!</p>
  546.  
  547. <!--more-->
  548.  
  549. <h3 id="scan-visibilities---introduction">Scan Visibilities - Introduction</h3>
  550.  
  551. <p>Every time you submit a URL to urlscan.io you can select the <em>visibility</em> for
  552. the scan result. The visibility controls which parties will be able to see the
  553. URL you submitted and retrieve the scan results.</p>
  554.  
  555. <ul>
  556.  <li><strong>Public</strong> means that the scan will be visible on the front page and in the
  557. public search results and info pages. It will be visible to any visitor on
  558. urlscan.io and search engines as well.<br />
  559. You should only use <em>Public</em> scans if there are no concerns that the URLs you are
  560. submitting contain any personal or proprietary information, either in the URL
  561. itself or in the content of the page. This could be because you sourced these
  562. URLs from another public data set, or because you discovered these URLs
  563. yourself via crawling or keyword monitoring.</li>
  564.  <li><strong>Unlisted</strong> means that the scan will <em>not</em> be visible on the public page or
  565. search results, but will be visible to customers of the urlscan Pro platform.
  566. We only admit customers to urlscan Pro which are either vetted security
  567. researchers or reputable corporations.<br />
  568. You should use <em>Unlisted</em> scans if you think that there might be personal or
  569. proprietary information within the websites, but you still want to document
  570. the URLs to the audience of urlscan Pro so that they can take action
  571. accordingly (automated takedowns, research, improving their products).</li>
  572.  <li><strong>Private</strong> means that the scan will only be visible to yourself and not to
  573. any of our customers or partners. If you are part of a team account and have
  574. the team set as “Active”, then your private scans will also be visible to
  575. other team members on that team account.<br />
  576. You should use <em>Private</em> scans if you don’t want to share the scans you
  577. perform with anyone else. The downside is that unique URLs that you might
  578. submit will not be seen by anyone else, and potential malicious activity
  579. might go unnoticed by the community and downstream security companies on the
  580. urlscan Pro platform.</li>
  581. </ul>
  582.  
  583. <p>There are different reason for choosing specific visibility levels, and picking
  584. the right one very much depends on source of the data you are analysing.
  585. Customers might have different streams of URLs they want to analyses with
  586. urlscan.io, and each stream might have its unique set of privacy
  587. considerations.</p>
  588.  
  589. <p>We encourage user to use <em>Public</em> or <em>Unlisted</em> scans whenever possible since
  590. it helps the whole security community keep track and understand threats rather
  591. than siloing that information. But we understand that there are use-cases which
  592. don’t allow anything but the <em>Private</em> visibility level.</p>
  593.  
  594. <h3 id="reviewing-your-submission-results">Reviewing your submission results</h3>
  595.  
  596. <p>Whether you use urlscan.io via the UI or the API you should frequently review
  597. your submission results to ensure that you are not submitting inappropriate
  598. URLs at any visibility level. To see a list of your own submissions first make
  599. sure you are logged in to urlscan.io before executing the following searches:</p>
  600.  
  601. <ul>
  602.  <li><a href="https://urlscan.io/search/#user%3Ame%20OR%20team%3Ame">Search: <strong>All Scans</strong> submitted by yourself and your teams</a></li>
  603.  <li><a href="https://urlscan.io/search/#(user%3Ame%20OR%20team%3Ame)%20AND%20task.visibility%3Apublic">Search: <strong>Public Scans</strong> submitted by yourself and your teams</a></li>
  604. </ul>
  605.  
  606. <p>Make sure you understand <em>where</em> submissions are originating from: This might
  607. be your employees or your automated tools such as SOAR platforms! You should
  608. watch out for submissions for the following types of websites:</p>
  609.  
  610. <ul>
  611.  <li>Hosted invoice pages</li>
  612.  <li>DocuSign or other document signing requests</li>
  613.  <li>Google Drive / Dropbox links</li>
  614.  <li>Email unsubscribe links</li>
  615.  <li>Password reset or create links</li>
  616.  <li>Web service, meeting and conference invite links</li>
  617.  <li>URLs including PII (email addresses) or API keys</li>
  618. </ul>
  619.  
  620. <h3 id="setting-and-enforcing-a-default-visibility">Setting and enforcing a default visibility</h3>
  621.  
  622. <p>urlscan.io allows you to set a default visibility and even to enforce this as
  623. the maximum visibility for all future scans. Both settings can be found in your <em>Settings</em> window on your user dashboard.</p>
  624.  
  625. <p>Team account owners can change these settings team-wide and have them be
  626. applied to every active team member. This is done on the <em>Settings</em> page for
  627. the team account.</p>
  628.  
  629. <div class="row">
  630. <div class="col col-md-6">
  631. <img src="/blog/assets/images/scan-visibility.png" title="urlscan Scan Visiblity" />
  632. <p class="help-block">The scan visiblity settings dialog in your user dashboard</p>
  633. </div>
  634. </div>
  635.  
  636. <h3 id="what-we-are-doing-to-prevent-information-leaks">What we are doing to prevent information leaks</h3>
  637.  
  638. <p>We are aware of the fact that non-public information is being scanned on
  639. urlscan.io and are taking a number of steps to mitigate this issue.</p>
  640.  
  641. <ul>
  642.  <li>We have domain and URL pattern blocklists in place which prevent scanning of certain websites.</li>
  643.  <li>We have deletion rules in place which delete past and future scans for certain keywords and patterns.</li>
  644.  <li>We have recently made the <em>Scan Visibility</em> setting in our user dashboard more visible and easier to understand.</li>
  645.  <li>We have reached out to customers who we identified as submitting a significant amount of Public scans.</li>
  646.  <li>We allow immediate takedown of single scans via the <em>Report</em> button on each scan page.</li>
  647.  <li>We work with customers and third parties to facilitate bulk-delete via our deletion rules.</li>
  648.  <li>We are reviewing popular third-party integrations such as SOAR tools to ensure they respect the user intent with regards to visibility.</li>
  649. </ul>
  650.  
  651. <h3 id="security-researchers">Security Researchers</h3>
  652.  
  653. <p>If you are a <em>security researcher</em> and have discovered a large number of scans
  654. with non-public information we would ask that you reach out to
  655. <a href="mailto:security@urlscan.io">security@urlscan.io</a> and work with us to get the
  656. offending scans removed and to investigate the source of these scans.</p>]]></content><author><name>Johannes Gilger</name></author><category term="knowledge" /><summary type="html"><![CDATA[This post talks about the different scan visibilities available on urlscan.io, which visibility you should use for different purposes and how to review your submission results on urlscan.io to detect and prevent inadvertent information leaks. tl;dr: Understand the different scan visibilities, review your own scans for non-public information, review your automated submission workflows, enforce a maximum scan visibility for your account and work with us to clean non-public data from urlscan.io!]]></summary></entry><entry><title type="html">urlscan Pro - Product Updates for Q2 / 2022</title><link href="https://urlscan.io/blog/2022/07/11/urlscan-pro-product-updates-for-q2-2022/" rel="alternate" type="text/html" title="urlscan Pro - Product Updates for Q2 / 2022" /><published>2022-07-11T00:00:00+02:00</published><updated>2022-07-11T00:00:00+02:00</updated><id>https://urlscan.io/blog/2022/07/11/urlscan-pro-product-updates-for-q2-2022</id><content type="html" xml:base="https://urlscan.io/blog/2022/07/11/urlscan-pro-product-updates-for-q2-2022/"><![CDATA[<p>Today marks the last day of major features releases we had planned for Q2. This
  657. post will cover the highlights of new functionality in our urlscan Pro
  658. platform.</p>
  659.  
  660. <h4 id="saved-searches--subscriptions">Saved Searches &amp; Subscriptions</h4>
  661.  
  662. <p>You can now save a search in urlscan Pro to be able to run it again later. On
  663. top of that, you can also receive an email alert whenever there are new hits
  664. for your saved searches. This allows you to create a number of hunt queries
  665. which might only trigger occasionally and automatically receive notifications
  666. when there are new hits.</p>
  667.  
  668. <div class="row">
  669. <div class="col col-md-7">
  670. <img src="/blog/assets/2022-07-11-urlscan-pro-product-updates-for-q2-2022/urlscanpro-searches.png" title="Saved Searches" />
  671. </div>
  672. <div class="col col-md-5">
  673. <img src="/blog/assets/2022-07-11-urlscan-pro-product-updates-for-q2-2022/urlscanpro-email-alert.png" title="Saved Searches" />
  674. </div>
  675. </div>
  676.  
  677. <h4 id="search-ui-improvements">Search UI Improvements</h4>
  678. <p>The Search page in urlscan Pro was significantly improved:</p>
  679. <ul>
  680.  <li><em>Filters</em> are a convenient way to add, remove and invert pre-defined common
  681. search filters. We have a list of pre-defined filters that you can work with.</li>
  682.  <li>In the Search view, the new <em>Aggregations</em> list shows you aggregate
  683. information from your search results and allows you to further filter your
  684. results by adding another facet to your filters.</li>
  685.  <li>The new quick filter dialog also contains completions for the <em>Brand Names</em>
  686. that we track in urlscan Pro.</li>
  687.  <li>Whenever you have created an interesting search, you can now save it as a
  688. <em>Saved Search</em> directly from the search UI.</li>
  689.  <li>You can also use the new <em>CSV Export</em> feature to retrieve the results as a
  690. CSV file.</li>
  691. </ul>
  692.  
  693. <h4 id="file-downloads">File Downloads</h4>
  694. <p>In the process of scanning websites, urlscan.io will sometimes encounter file
  695. downloads triggered by the website. If we are able to successfully download the
  696. file, we will store it, hash it and make it available for downloading by our
  697. customers.</p>
  698.  
  699. <p>To highlight this stream of data, we have created a separate <em>Downloads</em> section
  700. which contains the most recent file downloads and highlights the information we
  701. store for each downloaded file. There is a dedicated Help Section on
  702. Downloads which talks about API use and known limitations of this feature.</p>
  703.  
  704. <h4 id="live-scanning">Live Scanning</h4>
  705. <p>The following features were added to the the Live Scanning UI:</p>
  706. <ul>
  707.  <li>Additional devices available for device emulation (iPhone 12, 13, etc)</li>
  708.  <li>Scanners can be selected via a new Select All button</li>
  709.  <li>Scanner Details can be shown, such as the current exit IP, AS and VPN provider</li>
  710.  <li>Scan Results have been cleaned up to give a better overview</li>
  711.  <li>Outgoing Links can now be scanned with a dedicated button</li>
  712.  <li>Available Live Scan Quotas are shown within the scanning UI</li>
  713. </ul>
  714.  
  715. <h3 id="urlscan-pro-trial">urlscan Pro Trial</h3>
  716.  
  717. <p>If you would like to take <strong>urlscan Pro</strong> for a spin just reach out to
  718. <a href="mailto:sales@urlscan.io">sales@urlscan.io</a>. We offer 30-day free trials with
  719. no strings attached.</p>]]></content><author><name>Johannes Gilger</name></author><category term="changelog" /><category term="product" /><summary type="html"><![CDATA[Today marks the last day of major features releases we had planned for Q2. This post will cover the highlights of new functionality in our urlscan Pro platform. Saved Searches &amp; Subscriptions You can now save a search in urlscan Pro to be able to run it again later. On top of that, you can also receive an email alert whenever there are new hits for your saved searches. This allows you to create a number of hunt queries which might only trigger occasionally and automatically receive notifications when there are new hits. Search UI Improvements The Search page in urlscan Pro was significantly improved: Filters are a convenient way to add, remove and invert pre-defined common search filters. We have a list of pre-defined filters that you can work with. In the Search view, the new Aggregations list shows you aggregate information from your search results and allows you to further filter your results by adding another facet to your filters. The new quick filter dialog also contains completions for the Brand Names that we track in urlscan Pro. Whenever you have created an interesting search, you can now save it as a Saved Search directly from the search UI. You can also use the new CSV Export feature to retrieve the results as a CSV file. File Downloads In the process of scanning websites, urlscan.io will sometimes encounter file downloads triggered by the website. If we are able to successfully download the file, we will store it, hash it and make it available for downloading by our customers. To highlight this stream of data, we have created a separate Downloads section which contains the most recent file downloads and highlights the information we store for each downloaded file. There is a dedicated Help Section on Downloads which talks about API use and known limitations of this feature. Live Scanning The following features were added to the the Live Scanning UI: Additional devices available for device emulation (iPhone 12, 13, etc) Scanners can be selected via a new Select All button Scanner Details can be shown, such as the current exit IP, AS and VPN provider Scan Results have been cleaned up to give a better overview Outgoing Links can now be scanned with a dedicated button Available Live Scan Quotas are shown within the scanning UI urlscan Pro Trial If you would like to take urlscan Pro for a spin just reach out to sales@urlscan.io. We offer 30-day free trials with no strings attached.]]></summary></entry><entry><title type="html">Visual Search and Live Scanning APIs GA</title><link href="https://urlscan.io/blog/2022/05/30/visual-search-livescan-api/" rel="alternate" type="text/html" title="Visual Search and Live Scanning APIs GA" /><published>2022-05-30T00:00:00+02:00</published><updated>2022-05-30T00:00:00+02:00</updated><id>https://urlscan.io/blog/2022/05/30/visual-search-livescan-api</id><content type="html" xml:base="https://urlscan.io/blog/2022/05/30/visual-search-livescan-api/"><![CDATA[<p>As of today, our <strong>Live Scanning</strong> and <strong>Visual Search</strong> features are no longer
  720. considered <em>Beta</em>. The APIs for these features are now stable and can be used
  721. in production use-cases. Customers on our <em>Professional</em> and <em>Enterprise</em>
  722. subscription tiers will find API documentation for these features in the
  723. urlscan Pro platform.</p>
  724.  
  725. <h4 id="visual-search">Visual Search</h4>
  726. <p>Visual Search allows users to find historical scans with visually similar
  727. screenshots to a scan of interest. This type of feature is also called
  728. Content-Based Image Retrieval. Check out the <a href="/blog/2022/05/02/visual-search/">accompanying
  729. blog-post</a> to learn more.</p>
  730.  
  731. <h4 id="live-scanning">Live Scanning</h4>
  732. <p>Live Scanning allows you to scan websites quickly, from different locations,
  733. and with different browser options. Scan results are not automatically saved to
  734. urlscan.io, but you can use Store Scan if you want to archive a particular scan
  735. result.</p>
  736.  
  737. <p>Live Scanning is a very versatile capability that can be used for a number of
  738. common scenarios, including <em>Reconnaisance</em>, <em>Change Monitoring</em> and <em>Remote
  739. File Retrieval</em>.</p>
  740.  
  741. <h3 id="urlscan-pro-trial">urlscan Pro Trial</h3>
  742.  
  743. <p>If you would like to take <strong>urlscan Pro</strong> for a spin just reach out to
  744. <a href="mailto:sales@urlscan.io">sales@urlscan.io</a>. We offer 30-day free trials with
  745. no strings attached.</p>]]></content><author><name>Johannes Gilger</name></author><category term="changelog" /><category term="product" /><category term="api" /><summary type="html"><![CDATA[As of today, our Live Scanning and Visual Search features are no longer considered Beta. The APIs for these features are now stable and can be used in production use-cases. Customers on our Professional and Enterprise subscription tiers will find API documentation for these features in the urlscan Pro platform. Visual Search Visual Search allows users to find historical scans with visually similar screenshots to a scan of interest. This type of feature is also called Content-Based Image Retrieval. Check out the accompanying blog-post to learn more. Live Scanning Live Scanning allows you to scan websites quickly, from different locations, and with different browser options. Scan results are not automatically saved to urlscan.io, but you can use Store Scan if you want to archive a particular scan result. Live Scanning is a very versatile capability that can be used for a number of common scenarios, including Reconnaisance, Change Monitoring and Remote File Retrieval. urlscan Pro Trial If you would like to take urlscan Pro for a spin just reach out to sales@urlscan.io. We offer 30-day free trials with no strings attached.]]></summary></entry><entry><title type="html">Visual Search</title><link href="https://urlscan.io/blog/2022/05/02/visual-search/" rel="alternate" type="text/html" title="Visual Search" /><published>2022-05-02T00:00:00+02:00</published><updated>2022-05-02T00:00:00+02:00</updated><id>https://urlscan.io/blog/2022/05/02/visual-search</id><content type="html" xml:base="https://urlscan.io/blog/2022/05/02/visual-search/"><![CDATA[<p>Today we are launching <strong>Visual Search</strong> which is a powerful new search feature
  746. available through our <strong>urlscan Pro - Threat Hunting</strong> platform.</p>
  747.  
  748. <p><img src="/blog/assets/images/visualsearch.png" alt="urlscan Pro - Visual Search" /></p>
  749.  
  750. <h3 id="use-cases">Use-Cases</h3>
  751. <p>Visual Search allows users to find historical scans with visually similar
  752. screenshots to a scan of interest. This type of feature is also called
  753. <em>Content-Based Image Retrieval</em>. Instead of querying for historical scans using
  754. a structured textual query (such as search for a hostname or an IP address),
  755. Visual Search uses an existing screenshot image as the query. Visual Search
  756. works similar to popular <em>Reverse Image-Search</em> engines like Google’s <em>Search
  757. by Image</em> and the <em>TinEye Reverse Image Search</em>. Customers will be able to
  758. leverage Visual Search feature to discover previously undetected cases of brand
  759. impersonation or similar phishing pages based on the visual appearance of those
  760. sites.</p>
  761.  
  762. <h3 id="availability">Availability</h3>
  763. <p>Visual Search is available today through the urlscan Pro portal. The feature is
  764. currently in Beta until its API is finalized over the next few weeks. Further
  765. information about Visual Search is available to customers on the urlscan Pro
  766. platform.</p>
  767.  
  768. <h3 id="urlscan-pro-trial">urlscan Pro Trial</h3>
  769.  
  770. <p>If you would like to take <strong>urlscan Pro</strong> for a spin just reach out to
  771. <a href="mailto:sales@urlscan.io">sales@urlscan.io</a>. We offer 30-day free trials with
  772. no strings attached.</p>]]></content><author><name>Johannes Gilger</name></author><category term="changelog" /><category term="product" /><summary type="html"><![CDATA[Today we are launching Visual Search which is a powerful new search feature available through our urlscan Pro - Threat Hunting platform. Use-Cases Visual Search allows users to find historical scans with visually similar screenshots to a scan of interest. This type of feature is also called Content-Based Image Retrieval. Instead of querying for historical scans using a structured textual query (such as search for a hostname or an IP address), Visual Search uses an existing screenshot image as the query. Visual Search works similar to popular Reverse Image-Search engines like Google’s Search by Image and the TinEye Reverse Image Search. Customers will be able to leverage Visual Search feature to discover previously undetected cases of brand impersonation or similar phishing pages based on the visual appearance of those sites. Availability Visual Search is available today through the urlscan Pro portal. The feature is currently in Beta until its API is finalized over the next few weeks. Further information about Visual Search is available to customers on the urlscan Pro platform. urlscan Pro Trial If you would like to take urlscan Pro for a spin just reach out to sales@urlscan.io. We offer 30-day free trials with no strings attached.]]></summary></entry><entry><title type="html">Search: New searchable attributes</title><link href="https://urlscan.io/blog/2022/04/21/search-new-searchable-attributes/" rel="alternate" type="text/html" title="Search: New searchable attributes" /><published>2022-04-21T00:00:00+02:00</published><updated>2022-04-21T00:00:00+02:00</updated><id>https://urlscan.io/blog/2022/04/21/search-new-searchable-attributes</id><content type="html" xml:base="https://urlscan.io/blog/2022/04/21/search-new-searchable-attributes/"><![CDATA[<p>Today we are launching a major overhaul to our search index powering our
  773. urlscan.io and urlscan Pro platforms. This release will offer new functionality
  774. to community and paid users. We have gathered customer feedback and
  775. internal use-cases and came up with a list of additional attributes that would
  776. be helpful to search on. This post outlines the highlights of new available
  777. search attributes. All of the new searchable fields have been
  778. integrated in a <strong>backward compatible</strong> fashion, which means that any search
  779. which previously worked on urlscan.io will continue to work.</p>
  780.  
  781. <p><strong>The full list of searchable fields is available on the <a href="/docs/search/">Search API
  782. Reference</a> page.</strong></p>
  783.  
  784. <!--more-->
  785.  
  786. <h3 id="new-searchable-attributes">New searchable attributes</h3>
  787.  
  788. <p>Over the past two years we recognized various additional fields that we would
  789. like to be able to search.  The <strong>title of the page</strong> was an obvious addition,
  790. and we have also gone ahead and added fields like the <strong>age of the TLS
  791. certificate at the time the page was scanned</strong> or the <strong>Cisco Umbrella rank for
  792. the primary hostname of the page</strong>. We hope that these new fields will allow
  793. hunting for more interesting scans on urlscan.io.</p>
  794.  
  795. <h3 id="wildcard-search">Wildcard Search</h3>
  796.  
  797. <p>We have changed the way that certain fields are indexed so that these fields
  798. can more effectively be searched using regular expressions and wildcard
  799. expressions. Especially for fields containing arbitrary information like URLs
  800. or hostname it is often crucial to search using complex expressions, something
  801. which was hard or outright impossible to do before. With the wildcard fields,
  802. users can search for single characters in a field like the page url very
  803. quickly.</p>
  804.  
  805. <h3 id="content-search">Content Search</h3>
  806.  
  807. <p>Customers on our <em>Professional</em> and <em>Enterprise</em> plans can now find historical
  808. scans by searching for strings in the <strong>text of the website</strong>. We currently
  809. index the first 20kB of visible text content per site.</p>
  810.  
  811. <p>Customers can also search for structured information from the page, such as the
  812. names and types of <strong>input fields</strong>, the name of <strong>global JavaScript variables</strong>,
  813. and the <strong>detected technologies</strong> employed by a website.</p>
  814.  
  815. <p>Outgoing links from the website are now indexed by <strong>domain</strong> and <strong>full URL</strong>.
  816. These fields can be searched to incoming links from other websites.</p>
  817.  
  818. <h3 id="verdicts--brand-search">Verdicts &amp; Brand Search</h3>
  819.  
  820. <p>We have always allowed customers on our <em>Professional</em> and <em>Enterprise</em> plans
  821. to search for detections of malicious websites on our platform by means of our
  822. brand detection system. Now we also incorporate <strong>community verdicts</strong> into our
  823. search index and combine them with our verdicts to form a global verdict and
  824. score. These attributes are grouped under the <em>verdicts</em> key.</p>
  825.  
  826. <h3 id="whats-next">What’s next?</h3>
  827.  
  828. <p>As next steps we will integrate pivoting via the additional attributes to
  829. urlscan.io and our <strong>urlscan Pro</strong> threat hunting platform. Changes to these
  830. platforms will be announce on this blog and on the urlscan Pro platform.</p>
  831.  
  832. <p>You can reach out to us with any questions via
  833. <a href="mailto:support@urlscan.io">support@urlscan.io</a>.</p>
  834.  
  835. <p><strong>Editor’s Note</strong>: An earlier version of this blog-post was erroneously published
  836. early, we apologise for any confusion this might have caused!</p>]]></content><author><name>Johannes Gilger</name></author><category term="changelog" /><category term="product" /><category term="api" /><summary type="html"><![CDATA[Today we are launching a major overhaul to our search index powering our urlscan.io and urlscan Pro platforms. This release will offer new functionality to community and paid users. We have gathered customer feedback and internal use-cases and came up with a list of additional attributes that would be helpful to search on. This post outlines the highlights of new available search attributes. All of the new searchable fields have been integrated in a backward compatible fashion, which means that any search which previously worked on urlscan.io will continue to work. The full list of searchable fields is available on the Search API Reference page.]]></summary></entry></feed>

If you would like to create a banner that links to this page (i.e. this validation result), do the following:

  1. Download the "valid Atom 1.0" banner.

  2. Upload the image to your own server. (This step is important. Please do not link directly to the image on this server.)

  3. Add this HTML to your page (change the image src attribute if necessary):

If you would like to create a text link instead, here is the URL you can use:

http://www.feedvalidator.org/check.cgi?url=https%3A//urlscan.io/blog/feed.xml

Copyright © 2002-9 Sam Ruby, Mark Pilgrim, Joseph Walton, and Phil Ringnalda