Get Text from a Web Page
This sample program shows how to get text from a Web page document that's open in Reflection 2011. To do this, we get a Web control object to access the Web page document and then use the XPATH to identify the Web element (text) on the page, as follows:
IWebElement WebElement = WebDocument.GetElement("HTML/BODY/TABLE/TBODY/TR/TD")
You can also use the ID to work with Web elements. For an example, see Integrate Web and Host Data.
To create a Web page document
- Using a text editor, such as Notepad, create a new text file called table.htm.
- Copy this HTML Table Code, paste it into the text file, and then save the file.
In Reflection 2011, create a new Web page document and type the file path to table.htm in the URL box. For example, if you saved the file to
type the following path (in one continuous string):
- Name the Web page document Webtable.urlx and save it to the same location as "table.htm."
- Quit, and then restart Reflection 2011.
- In Visual Studio, create a new Console Application project. In the New Project dialog box, type Get a Web Object in the Name box.
- Select the Create directory for solution box, click Browse to select your API projects folder (for example, C:\MyProjects), and then click OK.
- In Solution Explorer, add the class assemblies. (For instructions, see Create an API Project.)
- Copy the code from Get a Web Object, paste it to the Program.cs tab, and then edit the path in the code to match the path to which you saved Webtable.urlx.
- Build the application (Build > Rebuild Solution) and then run it (Debug > Start Without Debugging).
A console box should appear with the text extracted from Webtable.urlx: "The currency conversion rate from Canadian to U.S. Dollar: [number]. Press any key to continue..."