Sie sind auf Seite 1von 3

Malicious Url is detected by using 3 major features:We score url on the set of features.

The urls matching maximum no of feature score will be classified as malicious .

Lexical Features
These features allow us to capture the property that malicious URLs tend to look different from benign URLs. For example, the appearance of the token .com in the URL www.ebay.com is not unusual. However, the appearance of .com in www.ebay.com.phishy.biz or phish.biz/www.ebay.com/ index.php could indicate an attempt by criminals to spoof the domain name of a legitimate commercial Web site. Similarly, we would like to capture the fact that there are certain red flag keywords that tend to appear in malicious URLs e.g., ebayisapi would appear frequently in the path of URLs attempting to spoof an eBay page.

Source code features


JavaScript exploits are typically obfuscated to prevent detection by automated or manual analysis. Attackers typically use special characters to encode script, either as direct ASCII or transformed by some simple character-to-character function: document.write(unescape(%3C%68%74%6D%6C%20 %6C%61%6E%67%3D%22%65%6E%22%20%6... Thus, we can use the ratio of special character subsequences (nonEnglish for en websites) to script length.

Network features

We have only captured payload size i.e size of the webscript . Executable can be arbitrarily long, and obfuscated script may add to payload size as well.

Forecasting Sales using Data Mining


A data mining algorithm is a set of heuristics and calculations that creates a data mining model from data. To create a model, the algorithm first analyzes the data you provide, looking for specific types of patterns or trends. The algorithm uses the results of this analysis to define the optimal parameters for creating the mining model. These parameters are then applied across the entire data set to extract actionable patterns and detailed statistics. The mining model that an algorithm creates from your data can take various forms, including: A set of clusters that describe how the cases in a dataset are related. A decision tree that predicts an outcome, and describes how different criteria affect that outcome. A mathematical model that forecasts sales. A set of rules that describe how products are grouped together in a transaction, and the probabilities that products are purchased together.

We can Association rule mining for sales forecasting.Where forecasting will be done by By analyzing the most frequent items present in the dataset. We can also use time series analysis for forecasting .As we analyze the previous years sales patterns for analyzing.

Das könnte Ihnen auch gefallen