HTML - Sitemap Spider Finding Links I No Longer Use
I redesigned my site, and changed all my page names. Everything I put up is brand new, and nothing links to the old stuff. But when I do a sitemap, it's some how including the old pages, which makes me believe that somewhere, something is linking to one of the old pages, and it's spidering all the old pages.
Is there a site or something that I can find what is linking to the old pages? Similar TutorialsThis problem is simple in need, but a little complex to explain. What I am looking for is a way to create a flowchart or a sitemap of each HTML file rather than my whole site. I need a program or application that will do this for me automatically instead of having to create this manually. I have all ready tried Visio, but it does not do what I desire. Specifically, I have multiple anchor links within each HTML file. These anchor links point to other 'panels' on my page. Within those 'panels' might exist additional links - external or internal. At some point, those links will end at a 'panel'. I need a way to visually show where each anchor link is going, where those links came from (which panel they originated from), and if any of them are 'broken'. I have attached two files that may explain this better. The flowchart_example.jpg shows exactly what I require (more or less). I really just need the boxes (labeled Panel 00, Panel 01, etc.), how each connects to the other based off of links that are in the HTML file, and eventually show where the links end (no more links in the panel). I do not need the '1 Click' or '2 Clicks' to show up. Those can be manually added later. The an_example.html file shows an HTML file that has the anchor links and external links that would need to show up in the flowchart. This HTML file more or less mirrors what I am looking for in the flowchart_example.jpg file. When I refer to Panel 00, that is the top of the page, after the title and black line (where the script actually begins). The user must start at the top. They cannot skip around to other panels. This is why a visual flowchart would be quite helpful. Additionally, the links at the top (scripts, reference, transfers, etc.) along with the top and back buttons on each panel should be ignored when creating the flowchart. Any information or assistance in finding an application or a way to accomplish this would be greatly appreciated. Thanks in advance. I have a many pages of XHTML-code and on those pages there are dozens of links to different files. Sometimes these files can change and it is very hard to find all the places where there are links to a certain file. Is there a program or something else out there that could be used? Preferably a free one, but it does not have to be. Thanks. Could you please only recommend a program that you have used or heard at least heard of from someone. This program would also have to work for offline pages that are stored on my hard drive. i was wondering if there was a simple method or a program that can find all active links linking from a certain directory. for example, say a website like http://www.website.com/index.html has 15 active links and 5 broken links. i want to be able to view a list of all the active links on the page without actually clicking all the links on the page. is that possible? if so, how? Hi, I have two test versions of my site: 1: http://www.monkeygambling.com/new%20...indexnew.shtml 2: http://www.monkeygambling.com/new%20...0working.shtml However the search engine spider simulation doesn't see any text on (2) http://www.webconfs.com/search-engin...-simulator.php Can anyone see why? thanks Today when I was doing some work with Googles webmaster tools it said i didn't have a sitemap. So I researched, made a sitemap xml and html, but what do i do with these files? Also when it site mapped my site why didn't it map everything. My site is a photography related site. I use Exposure Manager to display and sell my images, but sitemapping didn't see these pages even though they're under my domain name. Or is it because the domain name changes slightly. Instead of photographersnature.com with the EM pages it's http://photographersnature.exposuremanager.com. Thanks, Dave www.PhotographersNature.com Not sure if this should be here or the SEO Forum, but regardless... Currently I've got a sitemap for REMG, since the entire site is created dynamically I need this for SEO purposes. Everything is sweet except for one thing, the '&' symbol likes to error when bots go filtering through... So i added <![CDATA[ ]]> to my urls, yet I still seem to get a dodgy error from Yahoo Slurp and some other no name bot. XML This is only a small part of it, cbf pasting the entire thing. Code: <?xml version="1.0" encoding="UTF-8" ?> <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.sitemaps.org/schemas/sitemap/0.9 http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd"> <url> <loc><![CDATA[http://www.remg.com.au/index.cfm?fuseaction=home.content&cat=Corporate&sub=Services]]></loc> <changefreq>daily</changefreq> <priority>0.5</priority> </url> <url> <loc><![CDATA[http://www.remg.com.au/index.cfm?fuseaction=home.content&cat=Campaign Management]]></loc> <changefreq>daily</changefreq> <priority>0.5</priority> </url> Error Email We receive this email if the site generates errors... Sorry I can't paste all of it in Code: struct CAT Corporate⊂=Services FUSEACTION home.content NAME Corporate Profile Development HTTP_USER_AGENT NextGenSearchBot 1 (for information visit http://www.zoominfo.com/About/misc/NextGenSearchBot.aspx) QUERY_STRING fuseaction=home.content&cat=Corporate%E2%8A%82=Services&name=Corporate%20Profile%20Development So ya if you look at the Query String, all that jibberish. The actual URL is: http://remg.com.au/index.cfm?fuseact...%20Development Here is what you get if you search REMG on Yahoo... # Real Estate Media Group - Vendor=Request Quote http://www.remg.com.au/index.cfm?fus...%8A%82=Request... - Explore URL Obiviously if you try click the link, it goes to the error page since there's no content for cat=Vendor⊂=Request+Quote Anyone got any ideas? Or am I stuck in a rut Hello All member, How to create easy HTML sitemap ??? Please share your suggestion here ........ Hey, I have been creating websites for some tiem and usually when adding font styles i assigned font-weight:normal; in the CSS which in previous versions of Firefox and IE has lead the text to look exactly the same. Now with IE 7 and Firefox 3 i am noticing differences in my font-weight using my same technique so i researched it a little and found font-weight no longer acts correctly in Firefox 3. Is there a work around for this to get fonts looking the same? Normal content text is not the problem really. When i create Headings though with font size of say 26px it appears to be really heavy and i want to set font weight to normal or lighter to make it thinner but it doesn't work. I hope there is a work around as i don't want to go back to the stone age and have to use images for text to get sites cross browser friendly. Any advice? When I first put on my holiday page, someone informed me that they couldn't see the bottom of the page, even if they scrolled down. I increased the height of the I-frame to 1200. and it fixed the problem, but it increased the I-frame on all the pages. Is there a way to do like a floating I-Frame. Make sense? <iframe name="iframe" width=950 height=1200 hspace=25 src="content.html" scrolling="auto" frameborder=1 ></iframe> link to site: then click on Holiday greeting for page in question http://www.girlsincsoaz.org TIA Hi all, Have encountered a problem in my code, in that when i went back to format my page, I could no longer use the submit button. I was able to find that it occurred when i added in the <table> tags within the selector function but I for the life of me cannot figure out why that makes the form invalid or whatever is happening. If any1 could have a look and see what the issue is, it would be greatly appreciated Quote: <?php session_start(); if ($_SESSION['groupid'] < 1) { header("location:login.php?message=You do not have sufficient " . "permissions to view that page. Please login."); } ?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> <?php $page = "CapSim"; include("common/init.php"); ?> </head> <body> <?php include("header.php");?> <br/> <br/> <div id="page"> <?php include("dbconnect.php"); // Formatting $content = "<h3 align='center'>".$page."</h3>\n" . "<center>\n" ."<h3 align='center'></h3>"; $content .= "<h4>" . "Number of Tiers in the model" . "</h4>"; $TIER=$_GET["TIER"]; $content .= "<form action='capsim_home.php' method='get' target='_blank'>\n"; $content .= "<input type='text' name='TIER' value='$TIER' onKeyPress='return numbersonly(this, event)' />\n "; $content .= "<input type='submit'value='New Layer'>\n"; $content .= "</form>"; $content .= "<br/>" . "</center>\n" . "<br/><hr width='900px'/><br/>"; // Set up the form and hidden control variables $content .= "<form action='/cgi-bin/".$parentdir."/capsimController.py' method='post' target='_blank'>"; // Default values for fields $u = 10; $x = 15; $m = 16; $q = 1; $S_RT = 10000; $a_a = 2; $a_u = 3; $a_l= 1; $runs = 1; $o_i=100; $default_vals = 1; //Function for printing off the layers within the tiers Please take a look at http://csacalc.com/ and goto "OR FOR A SHORT TIME ONLY, GET A FREE REPORT". as you can see that brings up an email form which the CSA victim fills in and emails to me, and at end of the line he gets a report back from me. I did about 1,000 of these over last 4 years but recently [since Vista] I only get emails from Firefox etc whereas 95% USED to come from IE when I checked I found that this form "defaults" in Vista to the email thing inside MS Office, and if I say NO WAY I WILL PAY FOR THAT AGAIN [seeing I bought Office in 1990 for $1,000] and say send it via Windows Mail [or whatever it is now called] ALL the data is lost. So obviously I need to revise the code that used to work Could someone please explain the most painless way to do this? edit in fact seems we have a new windows now on top of vista/XP so are there even MORE new problems with this "upgrade"? So everytime I add a picture or text my website page gets "longer". I then move the text or picture with the "div" tag so I can position it where I like. But the webpage is still the same length as it was before I moved the object. So How can I stop the page from getting longer everytime I add text or picture to it? Thanks Hi, I am new in web designing. I need your guidance. I want to break a word if it is longer than table width. May be I am not good at Engilsh. So I'll explain it using example. Following is my code. Code: <html> <body> <table align="justify" width="100" border"3"> <tr> <td> TE-sdf-sdf-hkjd </td> </tr> </table> </body> </html> As here you can see the word "TE-sdf-sdf-hkjd" length is more than 100px. So the width of table becomes more than 100px. I want to break the word and fix the width of table to 100px. How can I achieve that? Waiting for your reply. Hi everyone, I hope I have posted this in the right area. This is my first time posting here. I currently have my main pages listed at the top of my site as graphics (see fussybaby.ca). I would like to potentially remove these graphics and use text links instead - I had the graphics designed for me, so I can't just go in and make new ones, however I'd like to change/rearrange the links and add some new ones. So my question is: How hard is this to do? Is this something I need to hire someone to do for me? Thanks in advance for any advice! My sticky footer now floats in the middle of the page after i declared a doc type. Any suggestions for a quick fix? doctype: <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html> HTML Code: <body> <div id="wrap"> <div id="header"></div> <div id="logoimg"></div> <div id="logotext"></div> <div id="searchbox"> <img src="" align="absmiddle" vspace="2px"/> <label for="searchfield">search: area, position, area+position</label><br> <input type="text" name="searchfield" value="" id="searchfield"/> <div id="geist"></div> </div> <div id="main"></div> </div> <div id="footer"></div> Code: body { background-color: #99ccff; background-image: url(http://); background-repeat: repeat-x; overflow: auto; margin: 0; padding: 0; height: 100%; } #wrap{ min-height: 100%; margin-top: -151px; } #footer { margin: auto; width: 100%; height: 100px; /* Height of the footer */ background: #3e4041; border-top: 1px solid #e0e0e0; clear: both; margin-top: 50px; } /*Opera Fix*/ body:before {/* thanks to Maleika (Kohoutec)*/ content:""; height:100%; float:left; width:0; margin-top:-32767px;/* thank you Erik J - negate effect of float*/ margin-bottom: 51px; } h1, h2, p { padding:0 10px; } #outer:after {/* thank you Erik J - instead of using display table for ie8*/ clear:both; display:block; height:1%; content:" "; } im having trouble finding a free website template for a game site if anyone knows any websites that would be a big help Here at ultdmovies.com, I need to find the width I made the whole thing, and plus the width of the forum at ultdmovies.com/forum. Thanks! im looking to see if there is a free template for my website perferrable red and white. i got a outline of my website way that i want the template to look like at this link http://www.freewebs.com/zxgamerreviews/test22.html note: the template would have to work in Kompozer thanks Hi, Using servlets and javascript. When I upload a file based on the mime type I operations are delegated from servlet. picture.gif file's mime type is image/gif Problem I am facing is when I remove the file extension of any file say picture.gif file to picture mime type is application/octet-stream. How can I detect the mime type of a file given the filestream or any other solution to this problem Thanks. Im trying to find the style sheet for this webpage http://www.opportunity.com/index.html any idea how? Thx guys! Matt |