What is website classification?
Web classification is increasingly being used by organizations looking to streamline and optimize their operations. Combined with artificial intelligence, it has a real impact on the security of data flow, as well as on organizational management. An example of website classification is the BTC Website Classification solution.
How does BTC Website Classification work?
BTC Website Classification is a fast and effective process for analyzing and categorizing each page visited by a user. By monitoring employee activity, the classifier determines whether the actions taken by employees are effective and productive. What’s more, through detailed security analysis, it provides automatic detection of potential threats, thus helping to safeguard the company against intentional or accidental data leakage.
BTC’s website classifier uses two methods to analyze URLs. The first, machine learning, analyzes content based on keywords. Deep learning analyzes the entire page and is able to understand the context, thus providing even more efficient classification.
In BTC Website Classification, the entire classification process is fully automatic. The classifier checks whether the page has text data on the basis of which it can perform categorization. With the help of artificial intelligence, it determines the category assigned to the page and the confidence factor of the algorithm’s selection of the category, if it is high, then with high probability the algorithm has acted correctly, indicating the actual category to which the page refers. The classifier also indicates whether the analyzed URL is productive.
The classifier performs a detailed security analysis, during which it checks, among other things, such elements as the structure of the site, the presence of an SSL certificate, the inclusion of redirects, and, what is particularly important in terms of security, the presence in databases and registries of dangerous sites. In the end, the classifier presents two categories, obtained with the help of two technologies, as well as a thorough security analysis and recommendation along with the certainty of the result.
BTC Website Classification is a solution designed to optimize and automate key activities. Administrators gain detailed information about the websites visited by employees, including security and productivity issues, in one place.

TOP WWW by BTC Website Classification
Organizations that want to guarantee security and at the same time automate operations and key processes must bet on proven, transparent and, most importantly, innovative solutions. That’s exactly what BTC’s Website Classification solution does, analyzing hundreds of new URLs every day, categorizing them accordingly, and checking for security and productivity.
We indicate the most popular categories and websites by the number of their queries. Safety is key, so we looked at what percentage of the sites visited in the last month are those that could be potentially dangerous.*.

Internet services and communications, is the category that was the most searched for in August. Nearly half the number of URL queries, more than 45%, were for sites on this topic.
Queries for the category of sites on computers, software and electronics as a percentage of the total accounted for more than 12%.
Searches for sites about the category of politics, law and government institutions appeared sporadically, in relation to the total, the result represents 8%, and sites about shopping and advertisements 6%.
Other sites with categories of industry and business, media and news, tourism, and sports and entertainment accounted for less than 5%.

Are the sites you visit safe?
In terms of security, we analyzed how many of the websites users visit have an outdated SSL certificate, how many of them contain redirects to other sites, and how many of them have an unsafe structure.
As many as 12% of the websites visited by system users have an invalid SSL certificate, and 19% of the analyzed sites have an unsafe structure. A negligible number of launched URLs contain redirects, at just 0.1% of sites.
The results obtained for the most popular categories, Web sites and security issues are optimistic. Employees most often visit categories of sites related to Internet services. Entertainment sites, which distract users from their work, account for a fraction of a few percent.
*Statistics are for the period from 01.08.2021 – 31.08.2021.