Page 1 of 2 12 LastLast
Results 1 to 10 of 12

Thread: Attention Serious Programmers!

  1. #1
    I want to write my first web-bot. Could anyone help me out here by pointing to the right website or a tutorial or just some tips as how to create a bot.
    What language is the best to write one?

    I am going to write a smiple bot that will not do much but only search for a "screen name" over all the web and will send me back the websites where the screen name is found. This is just for fun. Later I want to make a search engine bot...

    Help me out here!

    Thanks
    I.am
    <span style='color:black'> I am a part of all that I have met - Lord Tennyson</span>
    <span style='color:blue'>Try not to let your mind wander...it is too small and fragile to be out by itself</span>

  2. Software & Hardware   -   #2

  3. Software & Hardware   -   #3
    chalkmongoose
    Guest
    Look up information on the following using Google:
    Content indexing, with meta-tags for optimal results
    XMLHTTP parsing structure, and DOM structure
    XMLHTTP component stability, WinHTTP component stability
    Differences between XMLHTTP and WinHTTP parsing structure
    Multi-domain channeling
    Cross-referencing using stored indexes
    Regular Expressions, with cursory Linux background

    These are basically what you&#39;d need to do this. First of all, I&#39;d make a decision as to whether to use XMLHTTP and WinHTTP, as both have advantages.

  4. Software & Hardware   -   #4
    Thanks to both of you&#33; If you have used to create one then you guys could probably recommend which is the best one out of the two.
    Any recommendations???
    <span style='color:black'> I am a part of all that I have met - Lord Tennyson</span>
    <span style='color:blue'>Try not to let your mind wander...it is too small and fragile to be out by itself</span>

  5. Software & Hardware   -   #5
    What would be the easiest web-bot to create? So its not so complex to do it and i learn something while i make it?
    <span style='color:black'> I am a part of all that I have met - Lord Tennyson</span>
    <span style='color:blue'>Try not to let your mind wander...it is too small and fragile to be out by itself</span>

  6. Software & Hardware   -   #6
    What exactly is a web bot?

    I&#39;m creating a Diablo II bot for Pindleskin.

  7. Software & Hardware   -   #7
    I highly reccommend using C++ as the language of implementation for your bot.

    This is currently your best choice, as it gives you the flexibility to design th actual program any way you want, and can interface with your OSes API natively (without using some form of high level abstraction). Not only will you be able to write better code, but it will run faster, be easier to debug and if you write it well, it will be very easy to port to other computers.

    As for the design itself, please clarify what exactly you want your bot to do?
    From what I can understand, you want your bot to be able to crawl the internet or an intranet and gather screen-names. If this is the case, you will have a hard time programming algorithmic heuristics to identify whether a given text contains a screen-name, especially given the nature of screen-names of most people.

    Therefore, I suggest you implement a simple neural network which you can train to detect and discern the nature of screen-names. To this end, I suggest using a the Hopfiled neral network topology, with feed-forward links, and backpropogation for learning. You can then create a set of pages with known results, and train it to detect the screen-names. Obviously you will need to train it with a lot of pages (about 100) to get a good accuracy, and the more you train it with, the more it will learn and detect more accurately.

    If you want, I can provide you with more information, and sample implementations on this.

    For complete accuracy (as good as an intelligent human) you will have to implement fuzzy logic, and create fuzzy sets which the neural network can use to train itself with, thereby automating the entire process. Obviously,you would have to implement some form of cooperation and competition, with a genetic algorithm.

    Hope that helps,

    Monica

  8. Software & Hardware   -   #8
    Thanks Monica, that was exactly I was trying to find about.

    If you want, I can provide you with more information, and sample implementations on this.
    Yes, it would be really nice if you could do that.

    As for the bot, I understand what you are saying. In Php forms there must be a page something like members.php, what if i train the bot only to search within the php pages and also more only within mem???.php.

    Right now, I have no skills as such regarding creating one. All I have is logic, and I believe if you have the right logic you can program in anything.
    I would really appreciate if you could help me out in this.
    Thanks,
    I.am
    <span style='color:black'> I am a part of all that I have met - Lord Tennyson</span>
    <span style='color:blue'>Try not to let your mind wander...it is too small and fragile to be out by itself</span>

  9. Software & Hardware   -   #9
    Ya, any of the languages in Visual Studio .NET would be good since there are a ton of added functions for the web and that...

  10. Software & Hardware   -   #10
    Firstly, contrary to the advice given by many of the other replies to this post, I would recommend that you avoid using any .NET based language, or even Visual Studio at all for that matter. For those about to flame me, this is not born out of a personal dislike for the MS environment, but rather out of experience. While MS&#39;s libraries are great for little utility programs, their limitations soon become apparent when you try to handle large or non-trivial data. Additionally, its is a very bad idea to come to depend on those libraries as it will limit the scope of your program and consequently it will limit you too. Its a far better idea, and much more beneficial to yourself (as a learning experience) to write your own implementation, relying on nothing but the API provided by your OS (which can be abstracted from your code easily, ensuring maximum portability).

    Having said that, I would like to state that using the MS libraries is a good idea if you are on a deadline and the limitations are not as big a concern as your release date.

    As to the code itself, I highly recommend the following books:
    • Neural Networks: A Comprehensive Foundation (2nd Edition) - by Simon S. Haykin
    • Practical Neural Network Recipes in C++ - by Timothy Masters
    • Neural Smithing: Supervised Learning in Feedforward Artificial Neural Networks - by Russell D. Reed, Robert J. Marks II
    • Introduction to Fuzzy Sets, Fuzzy Logic, and Fuzzy Control Systems - by Trung Tat Pham, Guanrong Chen
    • Neuro-Fuzzy and Soft Computing: A Computational Approach to Learning and Machine Intelligence - by Jyh-Shing Roger Jang, Chuen-Tsai Sun, Eiji Mizutani
    • Genetic Algorithms and Fuzzy Multiobjective Optimization - by Masatoshi Sakawa

    And for sample implementations, I recommend the following (from simplest, to most complex):
    Feel free to contact me for any more information.

    I cannot be more specific to your problem without knowing more about the objective of the bot and the scale of the implementation.

    Monica

Page 1 of 2 12 LastLast

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •