Software Engineering

Botnets Malicious Software

Author: Editorial Staff
Posted on: 14 Oct 2019

Botnets are a network of malicious software that is headed by a botmaster from where they get all the instructions to execute. In this paper, a novel technique is proposed to infer malware network specifications given a sample of malware binary, which is a file consisting of instructions. (bin file is with .bin execution). Botmasters use Command-and-Control protocols to control malware-infected hosts for executing malicious activities. Every family of malware has its own set of instructions called fingerprints, which they follow to execute malicious activities. In the paper, they have tried to extract the fingerprint which acts as a unique identity for the malware family. They have employed reverse engineering in which they explore the messages and infer all the fields in them. But before applying the inference algorithm to the messages, they need to be decrypted since they are encrypted. For this purpose, they have proposed a system to extract C&C encryption keys by applying the dynamic analysis of the malware binary. So basically, they’re trying to extract the protocol specification from samples of malware communication binary and also decrypt it before applying the inference algorithm.

Since botnets can attack your computer and infect it through malicious activities, there is a need to devise a system that can detect suspicious activity and, through a series of steps, reverse the process. There is much work already done in the botnet domain, but the state-of-the-art techniques do not cater to the encrypted botnet protocols, i.e. while communicating, the botmaster and bots can send messages to one another in encrypted form. In order to unveil such C&C protocols, firstly, the messages should be captured and decrypted through the reverse-engineering protocols and type inference information. In order to infer the information of encryption, state-of-the-art techniques are enhanced to do binary analysis.

The key way to solve the issue is through reverse engineering. The first part of reverse engineering comprises the decryption of the traffic using dynamic analysis so that the keys can be extracted from the malware binary. Then, the second part consists of the automatic derivation of the specifications of the protocol by using type inference information over the traffic that has been decrypted. There are different types of malware families, each having its own signatures and protocols. The message format used is that of a malware family, ZeroAccess. By message, payload (executable file) is meant that be downloaded over the infected computer. Once the messages are accessed, encryption analysis is performed that filters out the candidates that may be behaving like encryption functions. After analyzing the output and input of network system calls, candidates are further filtered out. Then the static or derived nature of the encryption key so that decryption can be performed. Once the decryption protocol is learned, any message from the family of ZeroAccess can be decrypted.

Once the encryption analysis is done, the next step is protocol analysis. Different message types serve different purposes. For the sake of easiness, clustering of messages can be performed based on the message type. The message is then split into content, non-content, and magic fields. The magic field is the field that holds constant value for all the messages. Some other field types are also extracted like EXE field, dependent fields, composite field types, etc. Then, by using sequence alignment, each field is reconciled to form a single specification of the protocol.

The biggest motivation behind doing all the above steps, i.e. decryption of messages and extracting protocol specifications, is that the network signatures can be generated.

The overall proposition stated in the paper presents a good solution to infer the protocol of C&C being used in the malicious network of botnets. It also helps to alleviate the task of manually understanding malicious communications by providing detailed specifications of the protocol.

Cite This Work

To export a reference to this article please select a referencing stye below:

Editorial Staff

Academic Master Education Team is a group of academic editors and subject specialists responsible for producing structured, research-backed essays across multiple disciplines. Each article is developed following Academic Master’s Editorial Policy and supported by credible academic references. The team ensures clarity, citation accuracy, and adherence to ethical academic writing standards

Content reviewed under Academic Master Editorial Policy.