Spider Mastermind
Register | User Profile | Member List | F.A.Q | Privacy Policy | New Blog | Search Forums | Forums Home
Doomworld Forums : Powered by vBulletin version 2.2.5 Doomworld Forums > Classic Doom > Doom General > -control
 
Author
All times are GMT. The time now is 05:04. Post New Thread    Post A Reply
printz
CRAZY DUMB ZEALOT


Posts: 6844
Registered: 06-06


I saw this neat little release from doomworld/newstuff: http://www.doomworld.com/idgames/?id=16596

As it says, it allows you to control the player from an outside application. This would be very interesting to make bots without modifying the Doom source code. But if you move like that, do you get feedback from the Doom world? Or is the controller all blind like that?

Old Post 01-13-12 10:45 #
printz is offline Profile || Blog || PM || Search || Add Buddy IP || Edit/Delete || Quote
Maes
I like big butts!


Posts: 8662
Registered: 07-06



printz said:
But if you move like that, do you get feedback from the Doom world? Or is the controller all blind like that?


Obvious: you build a bot with full Artificial Vision and Artificial Intelligence sophisticated enough to understand Doom's visuals, and do all that over the Internet while at the same time sharing the experience on Facebook, Twitter and other Social Media.

Nothing that your average web developer can't do for morning exercises.

Old Post 01-13-12 11:20 #
Maes is offline Profile || Blog || PM || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
fraggle
Super Moderator


Posts: 5999
Registered: 07-00


Glad you like it! See my earlier thread as well.

Old Post 01-13-12 11:31 #
fraggle is offline Profile || Blog || PM || Email || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
printz
CRAZY DUMB ZEALOT


Posts: 6844
Registered: 06-06



Maes said:


Obvious: you build a bot with full Artificial Vision and Artificial Intelligence sophisticated enough to understand Doom's visuals, and do all that over the Internet while at the same time sharing the experience on Facebook, Twitter and other Social Media.

Nothing that your average web developer can't do for morning exercises.

No, that's too ambitious and would require too much fuzzy logic. It's hard to tell a computer what's bad and what's good.

Old Post 01-13-12 12:42 #
printz is offline Profile || Blog || PM || Search || Add Buddy IP || Edit/Delete || Quote
Maes
I like big butts!


Posts: 8662
Registered: 07-06



printz said:
No, that's too ambitious and would require too much fuzzy logic. It's hard to tell a computer what's bad and what's good.


My engineer's rigid mentality also tells me that this is probably a task worthy of a (cooperative) Ph.D. or subsidized research, but the world is full of 15-16 whiz kids that can apparently churn a top-selling iPhone app out of the blue over a weekend with no effort, so I learned not to exclude anything in the IT field

Old Post 01-13-12 13:32 #
Maes is offline Profile || Blog || PM || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
Creaphis
I will deliberately take a contrary position just for the sake of writing incredibly long arguments


Posts: 3991
Registered: 10-05


Even without modifying the source code you could modify the .wad graphics to simplify automatic recognition.

Old Post 01-13-12 15:59 #
Creaphis is offline Profile || Blog || PM || Email || Search || Add Buddy IP || Edit/Delete || Quote
fraggle
Super Moderator


Posts: 5999
Registered: 07-00


If a computer can play Jeopardy, it can surely play Doom.

Old Post 01-13-12 16:12 #
fraggle is offline Profile || Blog || PM || Email || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
chungy
Doomworld is so about bullshit excuses


Posts: 1458
Registered: 06-05


Well of course it's not impossible, but Jeopardy and Doom are two entirely different beasts. And even in the case of Watson, it was fed the answers via plain text as they were revealed; there wasn't any kind of perfected speech recognition or OCR going on during it. Granted it's still impressive that Watson had exactly the same time frame to think about the answers as real human players have. Of course I just completely overshadowed the massive technical feat of language recognition Watson pulled off; Jeopardy requires a very good level of comprehension and ability to see through subtle puns.

Doom would require quite extensive analysis and interpretation of the audio and visual output, and I'm sure it would be even more difficult using the original 320x200 display rather than running it at 1920x1200 or whatever high resolution... this is of course assuming I read the topic right and you're not allowed to just write the bot in the engine or otherwise give it a clear understanding of the game state, but instead to give it exactly the same output to decide game play that human players have.

Old Post 01-13-12 17:06 #
chungy is online now Profile || Blog || PM || Email || Search || Add Buddy IP || Edit/Delete || Quote
Mr. T
Senior Member


Posts: 1270
Registered: 12-03


A Watson-like brute force approach could work. Instead of giving it access to encylopedia upon encyclopedia of information, you would give it demos of every online match ever played. Instead of finding Jeopardy answers, it will find how to win in any given situation.

How's that?

Old Post 01-13-12 17:24 #
Mr. T is online now Profile || Blog || PM || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
Maes
I like big butts!


Posts: 8662
Registered: 07-06


Well yeah after you got through the daunting task of making sense out of pure A/V input and create a world map (and its status) out of it, you'd be at the point of a conventional bot :-p

So even if you had perfect interpretation of the map (just as an in-engine bot would), it would still be a long shot towards seeing any kind of worthwhile gameplay, as all current limitations of bots would still apply (and then some, because the A/V bot would not have a way to "cheat" its way out of stupidity by e.g. knowing where the exit or the pickups are beforehand).

If you want the state of the art in this field, search for robotics teams that try to guide Lego Mindstorms contraptions in very simple mazes using brightly colored balls and walls as cues.

Or maybe something like this:



If there is ANYTHING in the world even remotely more advanced than this, it must surely be classified and developed by the military.

Old Post 01-13-12 17:24 #
Maes is offline Profile || Blog || PM || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
qoncept
Junior Member


Posts: 154
Registered: 12-11



chungy said:
And even in the case of Watson, it was fed the answers via plain text as they were revealed; there wasn't any kind of perfected speech recognition or OCR going on during it.

I thought that was BS. With all the programming and and resources that super computer had, it would be trivial to add a webcam and make it play the same game as a human. I really think it should have had an actuator push the button to ring in, too.

Old Post 01-13-12 18:59 #
qoncept is offline Profile || Blog || PM || Search || Add Buddy IP || Edit/Delete || Quote
printz
CRAZY DUMB ZEALOT


Posts: 6844
Registered: 06-06


Let's be clear that I'm not interested in complicated stuff like that. I'd rather find a way (if the processor or virtual machine lets me) to acquire data from Doom (actors, ammo, coordinates... probably everything P_). Does statdump.exe, -statcopy or something derivative help?

Old Post 01-13-12 19:51 #
printz is offline Profile || Blog || PM || Search || Add Buddy IP || Edit/Delete || Quote
Maes
I like big butts!


Posts: 8662
Registered: 07-06


@printz: nope. Statcopy is simply a bunch of cumulative statistics. If you want 100% precision and direct access to the engine data, you need to run whatever code you want as a "plug-in" of sorts, or modify a port to be able to supply this information on-demand via an API.

I think this had been covered very extensively in a recent thread, too. The subject was different, but the core problem is exactly the same (and it's only the beginning of trouble).

Handicapping everything with a fuzzy A/V acquisition layer is something better left to someone with a few million dollars of budget, and to people willing to devote 2-3 academic years (aka base their degrees on it).

Old Post 01-13-12 19:56 #
Maes is offline Profile || Blog || PM || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
chungy
Doomworld is so about bullshit excuses


Posts: 1458
Registered: 06-05



qoncept said:
I thought that was BS. With all the programming and and resources that super computer had, it would be trivial to add a webcam and make it play the same game as a human. I really think it should have had an actuator push the button to ring in, too.

Nope; see https://en.wikipedia.org/wiki/Watso...er%29#Operation

Speech recognition or even character recognition were beyond the scope of the project. Quite simply, adding either one (or both) of them would be adding more complications and chances for error on the part of Watson and wouldn't be an effective demonstration or experiment.

Old Post 01-13-12 21:50 #
chungy is online now Profile || Blog || PM || Email || Search || Add Buddy IP || Edit/Delete || Quote
printz
CRAZY DUMB ZEALOT


Posts: 6844
Registered: 06-06



Maes said:

I think this had been covered very extensively in a recent thread, too. The subject was different, but the core problem is exactly the same (and it's only the beginning of trouble).

I'm very serious about that. I may not have the time to work on it or may be distracted by other hobbies, but I find it perfectly possible to pass through a -nomonsters map.

Old Post 01-13-12 22:21 #
printz is offline Profile || Blog || PM || Search || Add Buddy IP || Edit/Delete || Quote
Maes
I like big butts!


Posts: 8662
Registered: 07-06



printz said:
I'm very serious about that. I may not have the time to work on it or may be distracted by other hobbies, but I find it perfectly possible to pass through a -nomonsters map.


I do too, and furthermore I believe it's possible using only inverse goal analysis (top-to-bottom, e.g. from the end switch to the door next to it and so on), plain directed graph navigation plus some heuristics and randomized behavior here and there, in order to eventually navigate to a particular spot and avoiding unpassable/unescapable traps.

Old Post 01-13-12 22:39 #
Maes is offline Profile || Blog || PM || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
fraggle
Super Moderator


Posts: 5999
Registered: 07-00



chungy said:

Nope; see https://en.wikipedia.org/wiki/Watso...er%29#Operation

Speech recognition or even character recognition were beyond the scope of the project. Quite simply, adding either one (or both) of them would be adding more complications and chances for error on the part of Watson and wouldn't be an effective demonstration or experiment.

I was slightly disappointed at first as well, because I thought they had solved it in such a way that it did "the whole loop" (so to speak) including voice and character recognition.

But when I thought about it some more, it was quite honestly impressive enough in the things that it did manage to achieve. Plus it wouldn't really have adding anything to the demonstration. As the saying goes, do one thing and do it well.

Old Post 01-13-12 23:26 #
fraggle is offline Profile || Blog || PM || Email || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
Quasar
Moderator


Posts: 4615
Registered: 08-00


Besides, other companies are working on that stuff (speech recognition and computer vision).

Only when AI doesn't require a brain the size of a Ryder truck will adding vision and hearing to it really have useful consequences, and then you'll see these technologies merge at that point.

IBM's current commercial goal for Watson is to have it answer directly input questions where it has a lot of time to think and come up with alternatives. It needs neither sight nor hearing to do this :P

Old Post 01-16-12 04:31 #
Quasar is offline Profile || Blog || PM || Email || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
Maes
I like big butts!


Posts: 8662
Registered: 07-06


Plus, as I recently read somewhere, there's no such thing as "perfect" vision or speech recognition, even for humans: the human eye can be fooled by trompe l'œil, and it's very common in everyday life not to understand someone because he's using some dialectal expression or his accent is slightly off.

It's not even possible to analyze plain text just based on the literal meaning of words without a context knowledge/some way of giving "street smarts" to the AI, if popular expressions, idioms or coded words/slang is used.

If speech recognition was ever used in a mission-critical setting, it would require operator training in order to use only unambiguous expressions and a particular bureaucratic-like tone, not unlike military orders, and "conversations" would have to be one way at a time (signaling the end of a sentence with an "over" like in CW radio or military comms). There's a reason why even humans use this form in such settings.

If you forfeit these requirements, you simply end up with meaningless chit-chat and a bunch of jokes about how "computers are so stupid, they can't understand even a simple thing".

Old Post 01-16-12 09:32 #
Maes is offline Profile || Blog || PM || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
qoncept
Junior Member


Posts: 154
Registered: 12-11



chungy said:

Nope; see https://en.wikipedia.org/wiki/Watso...er%29#Operation

Speech recognition or even character recognition were beyond the scope of the project. Quite simply, adding either one (or both) of them would be adding more complications and chances for error on the part of Watson and wouldn't be an effective demonstration or experiment.


Then it was just a stupid decision. If you're going to showcase a computer that can play the same game as a human, changing the rules of the game is asinine.

As I said, OCR should have been trivial to add, and if it put Watson at enough of a disadvantage that it couldn't compete, go back to work.

Old Post 01-16-12 19:18 #
qoncept is offline Profile || Blog || PM || Search || Add Buddy IP || Edit/Delete || Quote
Maes
I like big butts!


Posts: 8662
Registered: 07-06



qoncept said:
As I said, OCR should have been trivial to add,


Sure, but would it be any good? What degree of reliability would it need to have to be "human grade"? 95%? 99%? 99.999%? What if it failed and lost time trying to decipher or misunderstood a question?

And would it need to use nothing more than standard TV cameras attached to a robotic head with focus no different than the human eye (aka no zooming)? Should it be optimized just for jeopardy's displays or more general purpose to avoid any and all criticism?

And BTW, is Jeopardy playable by on-screen cues alone, or are both hearing and vision required to make sense of it?

As you can see it would be a whole other set of problems to cope with, which would clearly cross the boundary of the AI they were trying to showcase here, and walk straight into robotics realm. Two different problems, two different worlds, two different tasks.

That being said, it would've been more awesome if they presented a human-like robot (even a non-walking square box) that could pack all the necessary processing power (including visual tracking and speech recognition) in a human-sized package, with at least as good performance in following the game as a human competitor. Maybe in 10 years from now.

Old Post 01-16-12 19:34 #
Maes is offline Profile || Blog || PM || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
qoncept
Junior Member


Posts: 154
Registered: 12-11



Maes said:
[B]
Sure, but would it be any good? What degree of reliability would it need to have to be "human grade"? 95%? 99%? 99.999%? What if it failed and lost time trying to decipher or misunderstood a question?

That's why they play the games. I can't answer that, because they didn't make Watson play by the same rules.


And would it need to use nothing more than standard TV cameras attached to a robotic head with focus no different than the human eye (aka no zooming)? Should it be optimized just for jeopardy's displays or more general purpose to avoid any and all criticism?

I think you should be able to wheel the thing in and make it play the game. Really, there shouldn't be any special consideration that require any explanation to the audience. Network cable, power, interface to supercomputer in the back, fine. But the way Watson received questions had to be explicitly described.

Feeding text files to Watson would completely make sense if dictation and OCR software never existed. But they do, so I expect them to go in to making a computer that plays on a trivia gameshow.

Old Post 01-16-12 22:34 #
qoncept is offline Profile || Blog || PM || Search || Add Buddy IP || Edit/Delete || Quote
Maes
I like big butts!


Posts: 8662
Registered: 07-06



qoncept said:
I think you should be able to wheel the thing in and make it play the game.


Which it did. So you could say that they achieved their end, even though the means were less than glamorous.


qoncept said:
Feeding text files to Watson would completely make sense if dictation and OCR software never existed. But they do


If they had a 100% accurate OCR that can work with the same conditions as the human eye and a "dictation software" that is at least as infallible as a man, that would be an achievement by itself, and would be touted in technological expos all over the world.

Only that the current "state of the art" is still mediocre and laughable, and would -pardon the pun- jeopardize the entire enterprise, if it was to be used.


qoncept said:
so I expect them to go in to making a computer that plays on a trivia gameshow.


Which they did. The "challenge" was to make a computer that could play jeopardy, not a ROBOT that overcomes still-standing voice and OCR problems.

Perhaps the best way to understand what was at play here is to sit back a moment and think how Deep Blue/Deep Thought and the best chess computers in general work: they are simply "thinking boxes" that think of nothing but the game of chess in a pure mathematical sense, and they are not build to concern themselves with e.g. visually scanning a chessboard or moving the pieces, nor do they need to in order to excel in what they do. Similarly, Watson doesn't need hearing and OCR in order to play Jeopardy.

It's a classic case of "I want my jetpack": people just expect too much without thinking of all the implications. Watson is not a fucking T-800, nor was it touted as one.

Old Post 01-16-12 22:51 #
Maes is offline Profile || Blog || PM || Homepage || Search || Add Buddy IP || Edit/Delete || Quote
chungy
Doomworld is so about bullshit excuses


Posts: 1458
Registered: 06-05


Watson wasn't even about playing Jeopardy, although it was an excellent way to test it out on what IBM was doing: trying to parse natural human language.

Actually, even implying that Jeopardy was the only purpose it was made for would seriously undermine the project. Think of robots (and as Maes mentioned earlier) that are designed to use cameras and navigate obstacle courses. A lot of them use courses out of LEGO or similar, they're easy bright color blocks to distinguish from, and programmers tend to get that down before getting more and more complicated and realistic terrain. the LEGO obstacle courses, or Watson playing Jeopardy for that matter, is just getting the technology's feet wet, so to speak, but it is by no means intended to be the end goal.

Old Post 01-16-12 23:49 #
chungy is online now Profile || Blog || PM || Email || Search || Add Buddy IP || Edit/Delete || Quote
All times are GMT. The time now is 05:04. Post New Thread    Post A Reply
 
Doomworld Forums : Powered by vBulletin version 2.2.5 Doomworld Forums > Classic Doom > Doom General > -control

Show Printable Version | Email this Page | Subscribe to this Thread

 

Forum Rules:
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is OFF
vB code is ON
Smilies are OFF
[IMG] code is ON
 

< Contact Us - Doomworld >

Powered by: vBulletin Version 2.2.5
Copyright ©2000, 2001, Jelsoft Enterprises Limited.

Forums Directory